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RELATIONSHIP OF SELF-ACCEPTANCE TO OTHER VARIABLES 
WITH SIXTH GRADE CHILDREN ORIENTED IN 
SELF-UNDERSTANDING! 

PAUL BRUCE* 

Child Welfare Research Station, State University of lowa 


The purpose of this study was to in- 
vestigate with sixth grade children some 
of the relationships between a measure of 
self-acceptance and other personality var- 
iables. The relationships were studied in 
two groups of Ss: one consisting of pupils 
who had taken part in a learning program 
designed to develop a more understanding 
and analytical approach to their own and 
others’ behavior (the experimental group) 
and one group consisting of pupils who 
had not undertaken such a program (the 
control group). 

A review of the literature—particularly 
the writings of Rogers (15) and other 
researchers with a client-centered therapy 
orientation—suggested that  self-accept- 
ance be defined as the congruence between 
the way a person thinks himself to be 
(self-concept) and the way he thinks he 
would most like to be (ideal-self). Ac- 
cordingly, in this study, a measuring in- 
strument from which Self-Ideal Diserep- 
ancy scores were obtained was devised 
suitable to a sixth grade population. 

Within the framework of a self-ideal 
discrepancy measure of self-acceptance, a 


*This paper is based on a doctoral dis- 
sertation submitted to the Graduate College 
of the State University of Iowa in 1957. The 
writer is indebted to Ralph H. Ojemann 
for his encouragement and guidance through- 
out the study and to the members of the 
Preventive Psychiatry Project staff for their 
invaluable assistance. 

* Now at San Diego State College. 


child’s indication of his ideal-self can be 
viewed as his expression of how he feels 
some of his psychological needs (such as 
the needs for security, personal worth, 
status, ete.) can be best satisfied. In 
other words, underlying the conception 
of ideal-self is the assumption by the child 
that if he becomes more like his ideal- 
self, he will be better able to satisfy some 
of his secondary needs. For example, im- 
plicit in a child’s wanting “to be better 
looking,” “to have more friends,” “to be 
less sensitive,” etc., is the feeling that one’s 
affectional, security, and status needs 
(among others) would be better satisfied 
if these ideals were achieved. If a child, 
then, feels he is quite unlike his ideals or 
feels he is not making progress towards 
his ideals, it might be speculated that ade- 
quate satisfaction of several of his sec- 
ondary needs is being blocked in some 
way, and thus several predictions concern- 
ing his behavior might be made. 

It would be expected that a child who 
indicates a marked discrepancy between 
his self-concept and ideal-self, feeling his 
need satisfaction blocked, would show evi- 
dences of insecurity in his behavior and 
would yield responses indicating manifest 
anxiety. Such relationships between meas- 
ures of self-acceptance and various meas- 
ures of insecurity and anxiety have been 
obtained with subjects of college age and 
beyond (1, 2, 7, 13, 16). One of the 
major purposes of this investigation was 
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to see if the relationship between self- 
ideal discrepancy (as a measure of self- 
acceptance) and measures of observed 
insecurity and manifest anxiety held for 
a group of sixth grade children. 

The orientation program which the ex- 
perimental group of Ss had undertaken 
was designed to help each pupil develop 
a “causal,” analytical orientation towards 
his social environment. The program was 
based on the assumption that a child who 
is provided with the opportunity for 
understanding some of the many causes 
underlying his own behavior and the be- 
havior of people about him will be able 
to make more effective adjustments. In- 
volved in the program was the special 
training of the teachers and the use of cer- 
tain special curricular materials, the de- 
tails of which have been published else- 
where (9, 11, 12). 

With respect to this investigation, the 
orientation program provided a group of 
Ss who had been trained in self-under- 
standing, among other things. Several 
writers, such as Jersild (6) and Rogers 
(14), have indicated that an important 
characteristic of a self-accepting person 
is self-understanding. The question arises, 
then, as to whether a child with increased 
self-understanding is thereby more self- 
accepting. It might be hypothesized that 
as a child gains insight into his own mo- 
tivations and dynamics—that is, gains in 
self-understanding—he would give evi- 
dence of being more self-accepting. Also 
to the extent that the orientation pro- 
gram helps the individual learn how to 
work out his daily situations more ef- 
fectively, he may be helped to make 
progress towards his ideal thereby re- 
ducing the self-ideal discrepancies. 

On the other hand, increased self- 
understanding may not affect self-accept- 
ance, particularly when self-acceptance is 
defined in terms of self-ideal discrepancy. 
For example, an individual may see him- 
self some distance from his ideal, but with 
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increased self-understanding, understand 
some of the reasons for this situation. In 
such a case, self-ideal discrepancy might 
remain high even after the orientation 
program, although the individual would 
feel better—that is, be less anxious—about 
it. Thus, the orientation program might 
serve to reduce anxieties which normally 
accompany high self-ideal discrepancy 
without necessarily reducing the discrep- 
ancy itself. It appears, therefore, that the 
orientation program could have diverse 
effects upon a self-ideal discrepancy meas- 
ure of self-acceptance. 

The specific questions to which answers 
were sought in this study are: (a) Is there 
a significant difference between Ss with 
high self-ideal discrepancy and those with 
low self-ideal discrepancy in average scores 
on a test of manifest anxiety? (b) Does 
this relationship hold for Ss who have 
taken part in an orientation program in 
self-understanding? (c) Is there a signifi- 
cant difference between Ss with high self- 
ideal discrepancy and those with low self- 
ideal discrepancy in average ratings on an 
observation scale of insecurity? 


PROCEDURE 


Measures used in study. A Self-Accept- 
ance Scale was devised and initially ad- 
ministered to several sixth grade classes. 
The results and items were analyzed quali- 
tatively and quantitatively. The 10 items 
finally selected for the revised Self-Ac- 
ceptance Scale, in general, reflected af- 
fective characteristics about which indi- 
viduals in our culture were thought to have 
substantial feelings. (Only nine of these 
items were scored since one proved am- 
biguous in subsequent administrations of 
the seale.) A more detailed description 
and a copy of this scale appears elsewhere 
(3). 

Examples of typical items are these: 
“1. This is someone who feels that others 
don’t like him or her—someone whom 
nobody seems to care about;” and “8. 
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This person is happy and cheerful most 
of the time—one who seems to enjoy what 
he or she does.” Some of the items in the 
scale were adapted from the Social Analy- 
sis of the Classroom Inventory developed 
by the Horace Mann-Lincoln Institute of 
School Experimentation (5). Descriptive 
statements were utilized in place of the 
trait-names (used on most other scales 
of this type) in order to reduce the amount 
of ambiguity. 

The Ss were asked to answer four ques- 
tions about each of the descriptive state- 
ments: 


1. Would most boys and girls my age like 
to be like this person? (This question was 
not scored but was included as a check on 
the general favorability or unfavorability of 
each item.) 

2. Am I like this person? (Self-Concept) 

3. Am I becoming more like this person? 
(The results obtained by the use of this 
question were inconclusive and are not dis- 
cussed in this paper.) 

4. Would I like to be like this person? 
(Ideal-Self) 


The S answered each question indicating 
to what extent he felt it was true by 
checking along a 5-point scale reading: 
“very much so,” “quite a bit,” “some- 
what,” “not very much,” and “not at all.” 

The particular score yielded by this 
scale which was used in the major analyses 
reported in this paper is the Self-Ideal 
Diserepancy score. This score is the dis- 
crepancy between the rating in answer 
to Question 2 (Self-Concept) and the 
rating in answer to Question 4 (Ideal- 
Self) summed for the nine items. Test- 
retest reliability for this score is reported 
in the next section. 

The other measures used in this study 
included the Children’s Manifest Anxiety 
Seale developed by Castaneda et al. (4) 
and the Kooker Security-Insecurity Rating 
Seale. The Children’s Manifest Anxiety 
Seale consists of 42 anxiety items which 
are answered “yes” or “no.” The Kooker 
seale (8) gives evidence of children’s be- 
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havior which is independent of their re- 
sponses on a paper-pencil type test and 
which reflects their “typical” behavior pat- 
terns. This scale requires a trained observer 
who is familiar with the child’s behavior to 
rate him on a series of 19 behavior items, 
as occurring “frequently,” “fairly often,” 
or “seldom.” In the present study, each 
classroom was visited for a five-day period 
and the children rated by one of the two 
trained observers (who remained ignorant 
of the nature of this investigation). Both 
observers practiced rating the same Ss 
with the Kooker scale until they were in 
substantial agreement with each other. 
With the exception of the Kooker scale, 
the other measures were administered to 
all classes by the investigator. The teachers 
were not involved in the measurement as- 
pects of the investigtion and were not 
aware of the nature of the study. 
Subjects of investigation. The subjects 
in this study were pupils of eight sixth 
grade classes in different elementary 


schools located in comparable neighbor- 


hoods with respect to socioeconomic status. 
The four experimental classes had under- 
gone the orientation program described 
above. Most of the pupils in two of these 
experimental classes had been in the pro- 
gram for two consecutive years; whereas 
the pupils in the remaining two experi- 
mental classes had undergone the orienta- 
tion program for the current year only. 

At the beginning of the program, four 
control classes were selected controlling 
for some of the teacher variability. Thus, 
for every teacher of an experimental class, 
a control teacher was selected and matched 
on the basis of several variables including 
age, sex, number of years teaching experi- 
ence, and educational level. Unlike the 
experimental teachers, the control teach- 
ers did not have special training for this 
program and did not carry on the planned 
learning program in their classes. 

With respect to the Ss themselves, in 
the experimental classes 50 boys and 48 
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girls completed all of the testing involved 
in this study, while in the control classes 
tests were completed by 40 boys and 46 
girls. The average 1Q for the experimental 
classes was 105.9 and 105.4 for the con- 
trol classes as measured by the Otis Self- 
Administering Test of Mental Ability. As 
far as could be determined from school 
officials, no systematic method was used 
to assign the pupils to particular classes 
or to teachers (except for the two-year 
experimental group who were kept to- 
gether the second year specifically for the 
orientation program.) Thus, with the Ss 
coming from comparable home back- 
grounds, being randomly placed in their 
particular class groups, and being undif- 
ferentiated with respect to average in- 
telligence scores, the pupils in all eight 
classes were assumed to be originally a 
random sampling of a population of such 
pupils in their particular schools. 
Statistical analyses. To test two major 
problems of this investigation—that of 
establishing significant relationships be- 
tween self-ideal discrepancy and measures 
of manifest anxiety and observed insecur- 
ity—a three-dimensional (2 X 2 X 2) 
analysis of variance design was used (10). 
The factors controlled in this design were 
sex, the experimental-control condition, 
and self-acceptance. The Ss were divided 
into two groups according to their scores 
on the Self-Acceptance Scale—one group 
(the Highs) made up of those whose Self- 


TABLE 1 


Test-Rerest (ONE Week INTERVAL) 
RELIABILITY CORRELATION COEFFI- 
CIENTS OF THE SEVERAL SCORES OF THE 
Se.r-AccepTANCE SCALE FoR Eacu oF 
THE Two C.iasses StupIEp 








Self- Ideal- 
Concept Self 





-69 
54 


.83 
-93 
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Ideal Discrepancy scores fell above the 
median; the other group (the Lows) with 
those whose Discrepancy scores fell be- 
low the median. This median was com- 
puted using all 184 of the main group of 
Ss. In order to have equal frequencies in 
each cell of this design, 20 Ss were ran- 
domly selected from each group which 
represented a different combination of the 
factors being controlled; thus, there were 
160 Ss used in the analysis of variance 
tests. The dependent variables in the 
analyses of variance were the scores from 
the Children’s Manifest Anxiety Seale 
and the Kooker Security-Insecurity Rat- 
ing Scale. 

The possible effects of the orientation 
program on the relationship between self- 
ideal discrepancy and anxiety was tested 
by t tests of a difference in means between 
the experimental and control groups. 


RESULTS 


Results of preliminary testing. Test-re- 
test reliability of the Self-Acceptance 
Scale was determined by administering this 
test to two classes of sixth graders on two 
occasions, one week apart. Coefficients of 
correlation for the two administrations 
for the various sections of the scale are 
shown in Table 1. 

Although all of these coefficients are 
significant beyond the one per cent level 
of confidence, it will be noted that those 
for the Ideal-Self scores are somewhat 
lower than the others. This can be at- 
tributed to the limited range characteris- 
tie of this particular score—that is, the 
pupils tended not to vary from one an- 
other in the rating of their ideal-self con- 
cepts—thus, the correlation test was par- 
ticularly sensitive even to the smallest 
deviations from one administration to the 
next. 

In order to test whether or not intelli- 
gence was likely to be an important factor 
in the self-acceptance score, IQ scores 
from the Otis Self-Administering Test of 
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Mental Ability were secured from school 
records for all pupils who participated in 
the testing program. The coefficient of 
correlation between the Self-Ideal Dis- 
crepancy scores and IQ scores for the 47 
pupils participating in the preliminary 
study was —.08 indicating intelligence was 
not a factor. 

Analyses using Self-Acceptance Scale. 
Tables 2 and 3 show the results of analy- 
sis of variance tests using mean scores 
from the Children’s Manifest Anxiety 
Seale and the Kooker Security-Insecurity 
tating Scale, respectively, when the Ss 
were divided into two groups, Highs and 
Lows, according to whether they fell 
above or below the median Self-Ideal Dis- 
crepancy score. These tables show that 
the differences in the means on both the 
anxiety and insecurity scales between the 
Highs and Lows were statistically signifi- 
cant beyond the .01 level of confidence. 
Furthermore, these differences were in the 
direction such that the group with the 
relatively high self-acceptance (that is, 
low discrepancy scores) yielded average 
scores indicating less manifest anxiety, as 
measured, and less insecurity, as rated by 
observers. 

One qualification to these findings should 
be noted. In the analysis of variance tests, 
the interaction effects were not significant 
at the .05 level of confidence (the level 
prescribed prior to the investigation) ; 
however there was a tendency for an in- 
teraction to exist between the Self-Ideal 
Discrepancy scores and the experimental- 
control condition. (The F test of this in- 
teraction between the discrepancy scores 
and the experimental-control condition 
produced ratios of 3.14 and 3.29 for the 
Anxiety and Kooker scores, respectively, 
which are significant between the .10 and 
05 levels of confidence.) This tendency 
towards interaction indicates that the dis- 
crepancy measure of self-acceptance was 
related to the personality variables of 
anxiety and insecurity differentially ac- 
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TABLE 2 
MEANS AND ANALYSIS OF VARIANCE OF 
Scores ON THE CHILDREN’S MANIFEST 
ANx1IETY ScALE or Suspsects DivipEep 
inTo “Higus”’ anp “Lows’’ AccoRDING 
To Tuetr Sevr-Ipeat DiscREPANCY 
Scores ON THE SE LF- 
ACCEPTANCE SCALE 
(N = 20 1n Eacn Sex, Cuiass- 
TYPE GROUPING) 


| Children’s Manifest | 
Anxiety Scale 


i 

Self-Ideal Discrepancy |~ 
core 

Boys | Girls 


| | 
| Total F 


Highs 

« ; 
Lows | 
Highs 16.78 

Exptl. Classes (12.10 18.55 

Control Classes |17.05 19.40 
|__| 13 . 40** 
Lows ee 12.56 

Exptl. Classes |13.05)13.25, 

Control Classes i .70 12.25 








Total (Boys 13.48 15.86 
& Girls) | 





F | 4.30% | 


* Significant beyond the .05 level of confidence. 

** Significant beyond the .01 level of confidence. 
cording to whether or not the Ss were in 
the orientation program. The implications 
of this possible interaction will be dis- 
cussed below. 

The results reported thus far have indi- 
cated relationship between self-accept- 
ance, as measured, and the measures of 
manifest anxiety and observed insecurity. 
So as to obtain further information con- 
cerning the extent of this relationship, a 
correlational analysis of the major vari- 
ables was made, and the results are re- 
ported in Table 4. The coefficients of cor- 
relation for Self-Ideal Discrepancy scores 
with the Anxiety scores and Kooker scores 
were each significant beyond the .01 level 
of confidence. 

Effects of the orientation program. Anal- 
ysis of the data indicating the effects of the 
orientation program on the variables meas- 
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TABLE 3 
MEANS AND ANALYSIS OF VARIANCE OF 
Scores oN THE Kooker SeEcurity- 
Insecurity ScaLe or Sussects Dt- 
VIDED into “Hicus” anp “Lows’’ 
Accorpinc To THER Sevr-IpEaL 
DiscrREPANCY SCORES ON THE 
Se.r-AccEPTANCE ScALE 
(N = 20 1n Eacu Sex, Cuiass- 
TYPE GROUPING) 








Kooker Security- 
Insecurity 





Self-Ideal Discrepancy 
Score Total 
ws 


Lows 


Boys | Girls 





Highs 
Exptl. Classes 


25.10 
9025.65 
20/24 .64 


| 
Lows 


24 
Control Classes ([25. 





Exptl. Classes 
Control Classes 


22. 10/21.60 
24.75)22.80 








Total (Boys & 


24..24:23.68 


Girls) 











F <1 





* Significant beyond the .01 level of confidence 


TABLE 4 
COEFFICIENTS OF CORRELATION BETWEEN 
THE Seir-IpEaL Discrepancy Score oF 
THE SELF-ACCEPTANCE SCALE AND THE 
MEASURES OF ANXIETY AND 
INSECURITY 








Self-Ideal Discrepancy 





Girls | Total 


Boys | 
woste:, amine 


N= 





Manifest Anxiety .28* | .41° | .35* 
Scale 


Kooker Rating Scale | .33* | .34* | .32* 








* Significant beyond the .01 level of confidence 


ured appears in Table 5. Separation of the 
experimental classes made possible a com- 
parison of the effects between exposure to 
the program for one year and for two 
years. It will be noted that significant dif- 
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ferences on the measures of manifest anx- 
iety and observed insecurity appear be- 
tween the classes having had the program 
for two years and each of the other two 
groups involved—those having had the 
program for one year and those in the con- 
trol classes. These differences are in a di- 
rection which indicates that at least two 
years’ exposure to the orientation pro- 
gram may serve to lower manifest anxiety 
and reduce observed insecurity. Signifi- 
cant differences between the two-year ex- 
perimental group and the one-year experi- 
mental and control groups were not found 
with the Self-Ideal Discrepancy scores. 

An interesting question is, how do the 
Highs and the Lows with respect to Self- 
Ideal Discrepancy in the two-year experi- 
mental group compare with those in the 
control group? This analysis is given in 
Table 6. 

Inspection of this table shows that there 
is a difference in means between the High 
two-year experimental group and the con- 
trol group in manifest anxiety which is 
significant beyond the .01 level of confi- 
dence. In other words, it appears that 
those in the two-year experimental group 
who retained a relatively high Self-Ideal 
Discrepancy appeared to indicate less 
manifest anxiety (as measured) than did 
the control Ss who also had a relatively 
high Self-Ideal Discrepancy. This lends 
some support to the contention that at 
least a two year exposure to the orientation 
program may allow individuals to feel 
more comfortable about discrepancies 
which they feel exist between their self- 
concepts and ideal-self concepts. 

Inspection of Table 6 also reveals that 
the difference between the Highs and Lows 
in manifest anxiety in this two-year ex- 
perimental group is not significant. In 
other words, it appears that the major re- 
sults concerning the relationship between 
Self-Ideal Discrepancy scores and mani- 
fest anxiety should be qualified. High Self- 
Ideal Discrepancy scores may be associated 
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TABLE 5 
MEANS AND ¢ TESTS BETWEEN ONE AND Two YEAR EXPERIMENTAL GROUPS AND 
BETWEEN THE Two YEAR EXPERIMENTAL Group AND ConTrRoL Group ON MEAs- 
URES oF SELF-ACCEPTANCE, ANXIETY, AND INSECURITY 








y | 


set Ideal | Discrepancy, Manifest Anxiety Scale 


Kooker Rating Scale 





. | ¥ 


xX xX ‘ 





ee wes 


53 


One- Tour Exptl. Group 7.98 


| 
Two-Year Exptl. Group | 45 | 7.16 





7.88 





Control Group 86 | 


24.03 

















* Significant beyond the .05 level of confidence. 


TABLE 6 
MEANS AND ¢t TESTS ON THE CHILDREN’S 
ManNtrest ANXIETY SCALE BETWEEN 
THE Two YEAR EXPERIMENTAL AND 
ConTROL GROUPS AND BETWEEN THE 
Hicus anp Lows IN Sevr-IpEaAL 
Discrepancy Scores 





| Children's Manifest Anxiety Scale 





Dif- 
fer- ‘ 
ence 





Two-Year | 11.88 | 12.04 [0.16 
Exptl. (N = 16|\(N = 26) 
Group 








18.22 | 11.98 |6.24/4. 


site: \(N = 40) 


Control 
Group (N = 








Difference 








| 
| 
| 


t 





* Significant beyond the .01 level of confidence. 


with high Manifest Anxiety scores only 
for Ss with insufficient self-understanding ; 
this relationship may not apply to Ss ex- 
periencing an orientation program in self- 
understanding for at least two years. This 
finding also may explain the tendency for 
interaction effects to exist between the 
Self-Ideal Discrepancy scores and the ex- 
perimental-control condition, particularly 
in the case of the Anxiety test analysis 
which was noted above. 


Discussion 


The validity of the Self-Acceptance 
Scale. The results reported above indicated 
significant relationships between the Self- 
Ideal Discrepancy measure of self-accept- 
ance and scores from the Children’s Mani- 
fest Anxiety Scale and the Kooker Se- 
curity-Insecurity Rating Scale. These find- 
ings corroborate with sixth-grade-children 
findings of various other investigators who 
used similar measures with older Ss. The 
validity of the Self-Acceptance Scale is 
further substantiated by the fact of its 
relationship to an observation measure 
(the Kooker scale) which was independent 
of the children’s paper-pencil responses. 

The relatively low correlation between 
the Kooker and the Anxiety scales (V = 
184, r = .26) may indicate that these two 
instruments are measuring different vari- 
ables, and the Self-Acceptance Scale may 
be tapping some of the variables distine- 
tive to each of these other two scales as is 
indicated by the following coefficients of 
correlation (NV = 184): (a) Correlation 
between Self-Ideal Discrepancy scores and 
Anxiety scores = 35; (b) Correlation be- 
tween Self-Ideal Discrepancy scores and 
Kooker scores = .32. 

The results of this investigation (par- 
ticularly the analyses of the effects of the 
orientation program to be discussed in 
more detail below) raise a serious question 
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as to the adequacy of the self-ideal con- 
gruence concept of self-acceptance. A dis- 
crepancy between self and ideal might well 
mean different things to different individ- 
uals. While to one person a self-ideal dis- 
crepancy might be a threat to his self sys- 
tem, to another such a discrepancy might 
indicate that his aspirations are high and 
serve as a challenge to him. What seems 
to be important is not the discrepancy it- 
self, but the feelings about it. Thus, as the 
discussion below will indicate, it isn’t nec- 
essarily the high discrepancy alone which 
is associated with anxiety but a high dis- 
crepancy under certain conditions. 

The effects of the orientation program. 
In the introductory discussion, it was 
pointed out on an a priori basis that the 
orientation program might be expected 
to have diverse effects on the behavior of 
children. Among the early effects of such 
a program may be the reduction of the 
anxiety which seems to accompany a 
high self-ideal discrepancy. Bearing this 
out was the tendency for interaction ef- 
fects to exist, particularly in the analyses 
of the Anxiety scores, between the Dis- 
crepancy scores and the experimental-con- 
trol condition. This indicates that the re- 
lationship between the Discrepancy scores 
and the measure of anxiety was not the 
same for the experimental classes as for 
the control classes. 

Further evidence indicating the nature 
of this tendency for interaction is obtained 
when analysis is made of the scores of 
those classes in which most of the pupils 
had had the orientation program for a 
two-year period. It will be recalled from 
Table 6 that those with high Self-Ideal 
Discrepancy scores in the two-year ex- 
perimental group had average Anxiety 
scores which were significantly lower than 
those in the control group. Thus support 
is given to the contention that at least a 
two year exposure to an orientation in 
self-understanding may allow individuals 
to feel more comfortable about discrepan- 








cies which they feel exist between their 
self-concepts and ideal-self concepts. 

Indication that a period longer than one 
year in the orientation program may be 
needed for measurable changes to occur 
in such variables as manifest anxiety and 
observed insecurity was seen in the re- 
sults reported in Table 5. It will be re- 
called that in this analysis, the two-year 
experimental group had average scores 
which indicated less manifest anxiety and 
less observed insecurity when comparisons 
were made with the one-year experimental 
group and the control group. Even though 
the differences of Self-Ideal Discrepancy 
scores between these groups were not sta- 
tistically significant, the discussion above 
indicated that the feelings about a high 
discrepancy, where it existed, may have 
been changed as a result of the two-year 
orientation program. 

Obviously, a single study is not sufficient 
to establish the validity of this finding. 
Several studies will be needed to check 
these results, but those from this investiga- 
tion—that is, the tendency toward inter- 
action effects noted in the analyses of the 
Self-Acceptance Scale and the study of the 
two-year experimental group—suggest that 
anxieties relative to high self-ideal dis- 
crepancies may be more prevalent for 
those with insufficient self-understanding 
than for those who have been trained in 
understanding themselves. 


SuMMARY 


The purpose of this investigation was to 
construct a measuring instrument which 
could be reliably used in the investigation 
of purported relationships between self- 
acceptance and other important personal- 
ity variables and in the study of the possi- 
ble effects a learning program concerned 
with the causes of behavior might have on 
the participants in such a program. 

The Self-Acceptance Scale, constructed 
to measure self-acceptance in sixth grade 
children, consisted of a series of descrip- 
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tive statements with which each child 
rated himself according to the extent he 
felt he was like the description (the self- 
concept) and to what extent he felt he 
wanted to be like the description (ideal- 
self). Self-acceptance was defined as the 
congruence (that is, relative lack of dis- 
crepancy) between self-concept and ideal- 
self. 

The Ss were 184 pupils in eight sixth 
grade classes in different elementary 
schools of a medium-sized city in Iowa. 
Four of the eight classes had undergone 
a planned learning program designed to 
help each pupil acquire an understanding 
of the dynamic, variable, and complex na- 
ture of human motivation. 

The other measures used in this investi- 
gation included the Children’s Manifest 
Anxiety Seale and the Kooker Security- 
Insecurity Scale (ratings made by trained 
observers in the classrooms). 

The results indicated a statistically sig- 
nificant relationship (beyond the .05 level) 
between self-acceptance, as measured by 
the Discrepancy scores, and the measures 
of manifest anxiety and observed insecur- 
ity such that those with the smaller Dis- 
crepancy scores had average scores indi- 
cating less anxiety and less insecurity. 
However, a tendency was noted for inter- 
action effects to be present indicating that 
this relationship between self-acceptance 
and measures of anxiety and insecurity 
might have been operating differentially 
between the experimental and control 
classes. 

When analyses were made of those ex- 
perimental classes in which most of the 
pupils had had the program for two con- 
secutive years, support was found for the 
contention that although Self-Ideal Dis- 
crepancy scores remained high for some 
of these pupils, the Anxiety scores were 
significantly lower than those of either the 
one-year experimental or the control 
classes. This finding suggested that the 
statement of relationship between self- 





ideal discrepancy and anxiety might have 
to be qualified to apply to Ss with insuffi- 
cient self-understanding. Also, the two- 
year experimental group obtained average 
scores which indicated less manifest anxi- 
ety and less observed insecurity than did 
the average scores obtained by either the 
one-year experimental group or control 
group. 
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An organism’s adjustment to a new 
situation may be scrutinized for the vari- 
ability or fixedness of the behavior em- 
ployed. Presumably, during the early 
stages of solving a novel problem, be- 
havior is characterized by its variability; 
as commerce with the problem increases 
and correct solutions are achieved, the 
resulting habits act to decrease subse- 
quent variability. In a recent statement, 
Seott (12, p. 61) points out that such 
a sequence is characteristic even of the 
simplest organism: 


Such variability can be seen even in the 
lower organisms. If a paramecium runs into 
an obstacle it does not keep repeating its 
behavior. It backs off and approaches from 
a different angle, and never does exactly 
the same thing again. It is apparent that 
variability of this type is necessary for the 
process of adjustment, since an animal 
which gave fixed invariable responses could 
never adapt itself to a variety of changing 
conditions. 


The problem solving model presented 
by the paramecium might well be envied 
by humans. There is ample experimental 
evidence that under some circumstances 
human problem solving is characterized 
by fixedness rather than variability of 
response. If the demands of the problem 


*This study is part of an Ed.D. thesis 
submitted to the Graduate School of Ed- 
ucation, Harvard University, by the first 
author. The research was performed when 
both authors were members of the Harvard 
Teacher Education Research Project, sup- 
ported by grants from the Fund for the 
Advancement of Education. 


situation are similar to previously ac- 
quired solution methods, the transfer ef- 
fects to novel situations should, of course, 
be positive. Where the demands of the 
problem are only superficially similar to 
previous successful solutions the present 
application of these methods is often 
doomed to failure. The next adaptive step 
would be to abandon the first method, 
or hypothesis, and to try another, in 
contrast to the persistent application of 
the first. 

Fixedness in problem solving has been 
variously attributed to immediately prior 
problem solving experiences, to the na- 
ture of the problem at hand, and to per- 
sonality dispositions of the solver (1, 4, 
5, 8). 

One major cause of lack of variability 
is problem solving set. Of course, should 
the person’s set be appropriate for the 
problem the solution is likely to be fa- 
cilitated. Under the not infrequent con- 
ditions where his set is inappropriate, the 
problem solver has a dual task before 
him, ridding himself of the maladaptive 
set and then applying the new solution. 
Recognizing Gibson’s (3) indictment 
against the chaotic usage of the concept 
of set, we will define it for our purposes 
as “that manner of attacking a problem 
which is carried over from a previous to 
a succeeding problem situation.” 

Set precludes the variability of be- 
havior which is essential if the habituated 
tack is to be cast off. This study is con- 
cerned with a way of training children 
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in problem solving so that in the face 
of new problem situations they will either 
not develop sets or will give them up 
when they do not work. 

What the problem solver needs is a 
nonspecific approach, (sometimes called 
also “mode of attack” or “plan of ac- 
tion”), to many types of problems. One 
important aspect of this general approach 
is the assumption that problems often 
lend themselves to various solutions, so 
that if one solution does not work there 
is still the possibility that the problem 
may be solved. The solver might profit- 
ably attack the problem again, with a 
new orientation. How may such general 
problem solving behavior be inculcated? 
Maier (7) gave Ss specific training in 
reasoning and Luchins (6) prior to his 
einstellung situation instructed his Ss, 
“Don’t be blind.” Both investigators re- 
port that these devices aided Ss in over- 
coming habituated ways of solving prob- 
lems. 

This study investigates another method 
of inducing variability in problem solv- 
ing. One group of Ss is taught two 
solutions to the same set of problems, 
another group one solution to these prob- 
lems. Then the two groups are observed on 
a set of similar problems necessitating new 
solution methods. Next, a set is induced 
on a novel series of problems and the 
responses to situations where this set no 
longer works are observed. Our expecta- 
tions are that the people trained in al- 
ternative solutions will solve more test 
problems correctly, will exhibit greater 
variability to these problems, and will 
persevere longer on problems too difficult 
for them to solve. 


METHOD 


The Ss were 48 sixth grade children, 
25 boys and 23 girls, who lived in a pre- 
dominantly middle class, white collar 
community. 

Ss were divided into two groups of 
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24 children each, with the sexes repre- 
sented as equally as possible in each 
group. The two groups did not differ 
either in intelligence or school achieve- 
ment. E took each child individually from 
his classroom to a nearby experimental 
room. £ told the child that he was in- 
terested in “how children solved prob- 
lems.” The procedure consisted mainly 
of training and test series on two separate 
types of paper and pencil problems. 

Problem 1. The water jar problems. 
These problems are an adaptation of the 
water jar problems described by Luchins 
(5). S is required to obtain a specified 
amount of water using only empty jars 
and their total capacities as measures. 
The jars were drawn on cards, with a 
number in each jar denoting its capacity 
and a number in the margin indicating 
the amount of water to be obtained. E 
presented a single problem on a card, 
and S solved the problem on a work 
sheet. 

The training series consisted of 10 
problems, each solvable by the formulas, 
B — A —C, or B — 2A + C (the letters 
refer to the capacities of the three jars). 
E taught S to solve the problems by 
one or both of these solution methods 
depending on the experimental design be- 
low. 

Following the training problems and a 
repetition of the general instructions, S 
was presented six test problems. Each 
of the first five called for a novel solu- 
tion method; the sixth was solvable by 
the method taught during the training 
series. 

S was permitted to work on each test 
problem until he solved it or until he 
decided to stop. The time spent on each 
problem was recorded. 

Problem 2: The puzzle problems. Im- 
mediately following the completion of the 
above training and test series the Ss were 
given 13 puzzle problems. These are paper 
and pencil versions of jigsaw puzzles. S 
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was given a booklet on each page of 
which was a picture of a whole figure. 
Beneath the figure were pictured num- 
bered fragments of various sizes and 
shapes. The problems call for S to choose 
those fragments which, when put to- 
gether, will form the whole figure. Ss 
indicated their solutions at the bottom 
of the page by marking the numbers of 
the pieces they chose, in the order in 
which they used them. 

The first eight of the 13 puzzles were 
all solvable by assembling the fragments 
numbered 1, 3, 5, and 7. Each of the 
five final problems required a different 
solution from the initial set-inducing se- 
ries and from each other. Superficially, 
the test problems looked as though they 
were solvable by the original method. To 
the S, there was no break between the 
training and the test problems. 

No time limit was set on these prob- 
lems; a record of the time spent on each 
problem was kept. 

Experimental manipulations. The two 
groups of Ss differed only in the treat- 
ment they received during the training 
series of 10 water jar problems. One 
group was taught the two alternative 
solutions to these problems (labelled A 
group). E worked the first problem by 
one and then the other solution method 
and half of the succeeding problems were 
worked out by one or the other method. 
The no alternatives group (hereafter la- 
belled NA) were taught to solve the 10 
problems by only one solution method. 


RESULTS 


Stated generally, our hypotheses are 
(a) that experience in solving problems 
in more than one way should dispose the 
S to relinquish more quickly a maladap- 
tive set and consequently yield more cor- 
rect solutions to new problems, and (6) 
that this variability of response should 


generalize to problems different from 
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those on which the dual solutions were 
taught. 

Consider first the six test problems in 
the water jar series: Group A solved 111 
problems correctly, whereas Group NA 
produced 94 correct solutions. Though the 
direction of difference is as perdicted, the 
difference is not statistically significant. 
More critical than the total number of 
correct solutions is the behavior on the 
first test problem. A correct answer to 
this problem indicates to the S that the 
solution methods of the training period 
may not be applicable to the ensuing 
problems and hence may aid him in over- 
coming whatever set was induced by the 
earlier series. Group A achieved 12 cor- 
rect answers; group NA, 9. Here too the 
direction is interesting, but the differences 
are not significant. 

The results are clearer when we com- 
pare the two groups in the number of 
other-than-training solutions they offered 
to the six test problems. Since there is 
abundant evidence that the sexes differ 
in their problem solving behavior, “sex” 
was added to “treatment” as a criterion of 
classification in all the analyses of variance 
reported” As can be seen in Tables 1 
and 2, Group A offered significantly more 
“other than training” solutions than did 
Group NA. We may infer then that train- 
ing in alternative solutions leads to more 
variable problem solving behavior, though 
it is not clear that the increased vari- 
ability yields more correct solutions. 

No time limit was set on any water 


_ jar problem. The instructions directed the 


Ss to work on a problem until either a 
satisfactory answer was achieved or until 
he thought that it was no longer profit- 
able to continue on that problem. The 
amount of time spent on a problem be- 
fore giving up was, then, a measure of 
perseverance. Table 3 gives the number 


*A correction factor for the unequal cell 
frequencies was applied to all of the an- 
alyses of variance. 
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TABLE 1 
MEAN NuMBER OF OTHER-THAN-TRAINING 
So.LutTions To Test WATER 
Jar PROBLEMS 








Group A Group NA 





7.15 
(13) 
7.91 
(11) 


Note.—Numbers in parentheses are the numbers of 
Ss in each cell. 


9.33 
(12) 
11.42 
(12) 


Male 
Female 





TABLE 2 
Summary oF ANALYSIS OF VARIANCE OF 
OrTHER-THAN-TRAINING SOLUTIONS 
To Test WaTER JAR PROBLEMS 








Sum of | Mean 


Source 





Sex 

Treatment 

Sex X Treatment 
Error 

















*P < 06. 


TABLE 3 


MEAN PERSISTENCE (IN SECONDS) ON TEST 
Water Jar ProsplLems 








Group A Group NA 





343.43 178.11 
(7) (9) 
131.41 101.84 
(10) (8) 
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of Ss in each sex and group who gave 
up the problems before actually achiev- 
ing a correct solution and the time in 
seconds spent on the problems before 
giving up. The analysis of variance sum- 
marized in Table 4 indicates that boys 
persevered longer than girls and that 
Group A persevered longer than Group 
NA. The greater perseverance of boys 
confirms other findings about sex differ- 
ences in perseverance (10, 11). The 
training in multiple possibilities of solv- 
ing a problem likely taught the NA group 
that failure with one method did not 
exhaust the possibilities of success, so 
that they may profitably spend addi- 
tional time on the problem. This finding 
complements Robinsons’ (9), who reports 
that Ss who thought their chances of 
arriving at a correct solution to a problem 
were excellent spent more time on that 
problem than did Ss who were less con- 
fident. 

The first part of the experiment con- 
firms several of our expectations. There 
was a tendency for the group trained in 
alternatives to solve more problems cor- 
rectly. This group offered a greater va- 
riety of solutions to the test problems, 
and on those problems they could not 
solve they worked longer before they 
gave up. 

The puzzle problems were introduced 
to test whether the effects of training in 
alternative solutions would transfer to a 


TABLE 4 


Summary oF ANALYSIS OF VARIANCE OF PERSISTENCE 
on Test Water JAR PROBLEMS 








Source df Sums of Squares Mean Square 





9495.53 
20777 .78 
4607 .02 
37, 705.80 


9495.53 
20777 .78 
4607 .02 
1256 . 86 


Sex 
Treatment 
Sex X Treatment 


1 
1 
1 
Error 30 





"P< Ol. 
P< Ol. 





TRAINING IN PROBLEM SOLVING 


novel set of problems. Each of the first 
eight puzzles were solvable by the use 
of, in each case, pieces numbered 1, 3, 5, 
7. The succeeding five puzzles required 
different pieces for their correct construc- 
tion. Does training in an earlier, different 
type of problem influence behavior on 
the new test problems? 

As with the water jar problems, the 
A group reached more correct solutions 
than did their controls. They solved 61 
problems correctly; Group NA solved 
49. On the first test problem, there were 
four correct solutions in the A group 
and none in the NA group. Though nei- 
ther of these differences is statistically 
significant, their direction was predicted. 

If the set induced by the initial eight 
jigsaw problems is effective, the Ss should 
continue to use the same numbered pieces 
in the test problems, where they are now 
inappropriate. Table 5 gives the mean 
numbers of pieces with the same numbers 
as those used in the training puzzles for 
each group. The analysis of variance for 
these data is summarized in Table 6. The 
A group evidenced greater variability by 
using fewer incorrect pieces than did 
the NA group. This finding permits us 
to conclude that the ability to overcome 
inappropriate problem solving sets is ac- 
quired through alternative training in so- 
lution methods to the same problems 
and that this ability is transferable to 
problem situations other than the one 
on which the alternative training was 
acquired. 


Discussion 


Some impressions about the water jar 
problems peripheral to our interests, but 
germane to Luchins’ (5) results, occurred 
to us during the experiment. Luchins, it 
will be recalled, creates a set to use com- 
binations of three water jars during his 
training problems and during the test 
series many Ss are not able to use simpler, 
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TABLE 5 


Mean NumBer or Same-AsS-TRAINING 
Pieces (PeRsisTeNceE) Usep IN 





2.33 
(12) 
1.83 
(12) 





TABLE 6 
Summary or ANALYSIS OF VARIANCE OF 
NuMBERS OF SAME-AS-TRAINING PIECES 
UseEp 1n SoL_vine Puzz_es 





| 
Source d is ty te P 








Sex 

Treatment 

Sex X Treatment 
Error 








"P< M1. 


two jar solutions. The interpretation is 
that set precludes Ss from seeing the 
easier solution method. In some instances 
in the present study it was not that the 
Ss did not see the simpler solution, but 
rather they did not understand that they 
were permitted to use two jars. Some of 
our Ss actually tried and abandoned cor- 
rect two jar solutions because, as they 
explained it, they thought that since the 
initial problems involved three jars, the 
continued use of all three was a part of 
the task. It may be therefore, that in 
the previous water jar studies, there were 
people who were actually aware of the 
simpler solution but were not clear that 
it was acceptable. : 
The pedagogical implications of our 
findings are apparent. Although it in- 
volves an extrapolation from laboratory 
to classroom conditions, we might expect 
that a teacher’s consistent training in 
alternative solutions to problems might 
result in efficient overcoming of set. 
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Put into historical perspective, the find- 
ings of this study are congruent with 
Woodworth’s general factors theory of 
transfer. Apparently, what occurs when 
the child is given even brief training in 
solving problems by more than one 
method is that he develops a “general 
approach to novel problems.” Colloqui- 
ally, this approach is self-instruction to 
abandon an inefficient solution method 
and to try something new. One wonders 
whether this general problem solving skill 
was verbalized by our experimental Ss 
or whether it operated as an unverbalized, 
functional concept. Although we have no 
data on this point, we might expect that 
the verbalization of the principle would 
increase its efficiency. 


SuMMARY 


Two groups of sixth grade children were 
trained to solve 10 water jar problems. 
One group was given a single solution 
to the problems; the second was taught 
that the problems were solvable by either 


of two methods. On a succeeding set of 
test problems which necessitated solutions 
different from the two training methods, 
the alternative group tended, though not 
significantly, to solve more problems cor- 
rectly, offered significantly more other- 
than-training solutions, and persevered 
longer on problems they were unable to 
solve. 

To test the transfer effects of the orig- 
inal training, Ss were given 13 jigsaw 
puzzles: the first eight were all solvable 
by a single method so that a set might 
occur; the final five required various so- 
lution methods. Here, also, the Ss who 
were trained in alternative methods on 
the earlier water jar problems evidenced 
greater problem solving variability by 
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using significantly more pieces of the 
puzzles which were not used in the train- 
ing series. 
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Reviews of teaching research have con- 
sistently concluded that different teaching 
procedures produce little or no difference 
in the amount of knowledge gained by the 
students (1, 2, 3, 5, 6). This same con- 
clusion has been reached despite the fact 
that experimenters have employed a wide 
variety of independent variables, such as 
lecture versus discussion classes, instruc- 
tor-centered versus _ student-centered 
classes, large versus small classes, various 
types of TV classes, ete. These results are 
surprising if one considers that much of 
the research was instigated by the hypoth- 
esis that differences would be found. Fur- 
thermore, it appears as if most educators 
still assume that classroom techniques do 
in fact have specific effects. Why then have 
differences not been found? 

One obvious hypothesis is that the teach- 
ing methods which have been employed 
are not sufficiently distinct to produce sig- 
nificant differences in the amount of knowl- 
edge acquired. The purpose of this paper 
is to examine an alternative hypothesis, 
namely, that the different teaching meth- 
ods have, in fact, produced differential 
amounts of learning but that these effects 
have been masked in the measurement 
process. 

Typically, in measuring the effectiveness 
of different teaching methods, one of the 
major dependent variables has been the 
performance of the students on the final 
examination. It is clear that variance on 
the final examination is due to many fac- 
tors in addition to the specific teaching 
methods employed. Such things as the in- 


*A portion of this paper was presented 
at the 1958 meeting of the APA. 

*Now at the University of California, 
Riverside. 


tellectual ability of the student, his moti- 
vations, the amount of studying he has 
done outside of class, various personality 
factors and environmental pressures, etc. 
will also affect his performance. In re- 
search on teaching methods, most of these 
other variables are not controlled (usu- 
ally, the major control is to equate stud- 
ents for ability) and, since they undoubt- 
edly account for a significant proportion of 
the variance, it is perhaps not surprising 
that the diverse experiments on teaching 
methods have so consistently concluded 
that there are no significant differences as 
a function of teaching method employed. 

A basic difficulty of this conclusion stems 
from the fact that students prepare for 
examinations by studying outside of class. 
It is perfectly possible that two groups 
taught under different procedures may 
learn very different amounts in class, but 
when both then engage in significant extra 
amounts of study, the difference in per- 
formance becomes negligible. This is es- 
pecially true since most final examinations 
are based, at least in part, on information 
to be found in the student’s textbook. 
Thus, if both groups study the textbook 
equally, one might expect that they will 
do about equally well on the examination 
even though one group has learned more 
in class. The matter can, of course, be more 
complex. It may be, for example, that if 
Group A learns more in class than Group 
B, then Group B feeling itself less well 
prepared for the pending examination will 
study more than will Group A. That is, 
one might find that students in preparing 
for an examination, study until they feel 
they know the material to a certain de- 
gree, and if one group learns the material 
in class they will feel less need for addi- 
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tional study than the group which has 
not learned the material in class. 

This discussion suggests that to evalu- 
ate a particular teaching method effec- 
tively, testing of the students ought to oc- 
cur immediately after the method has been 
employed thereby avoiding contamination 
by the additional variable of outside study. 
This procedure would be a more direct 
test of the influences of the teaching situ- 
ation per se and would therefore be more 
likely to reveal its effects, even if these 
effects are not observable on a final exam- 
ination. 

The following experiment was designed 
to test this general prediction. The inde- 
pendent variable of class size was em- 
ployed and it was hypothesized (a) that 
students in a small class would do better 
than students in a large class on quizzes 
for which they had not prepared, and (5) 
that no differences would be found be- 
tween the classes when students were given 
an opportunity to study for an examina- 
tion. The experiment was not specifically 
concerned with the variable of class size. 
Rather, it was used because it offered a 
convenient and simple way of testing the 
hypotheses and because it has so consist- 
ently been shown to have no influence on 
the amount learned by students when tra- 
ditional measurements have been em- 
ployed. In one review on the effects of class 
size, the author concludes that there have 
been “... more than 200 studies which 
clearly reveal that there are no consistent 
differences” (4). 


METHOD 


Subjects 


Two sections of. about 150 students each 
were enrolled in the senior author’s Gen- 
eral Psychology Gourse. The day after the 
students returned from their Christmas 
vacation, they were informed that for the 
last two weeks of the semester the instruc- 
tor intended to meet a small class which 
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would be identical in all r.spects to the 
larger classes except for size. The class 
was arranged for Monday, Wednesday, 
and Friday at 8:00 A.M. and 42 of the 
300 students volunteered to attend that 
class rather than their large one. These 
students were divided into two groups of 
21 students and each student in one group 
was very closely equated with a student 
in the other group on the basis of the three 
hourly examinations they had previously 
taken in the course that semester. By a 
flip of a coin, one group of 21 was then 
selected for the small class and the other 
group remained in one of the two large 
classes. Thus, the Ss for the experiment 
consisted of two equated groups of 21 
students each, one group in a small class 
of 21 students and the other group in 
large classes of about 140 students. 


Procedure 


Since the experiment was not concerned 
with testing the unique advantages of a 
small class versus a large class, the teach- 
ing method used in the different classes was 
as nearly identical as possible. The small 
class met on MWF at 8:00 A.M. and the 
two large classes met on MWF at 10:00 
and 11:00 A.M., respectively. It was ar- 
ranged that the small class meet earlier in 
the day than the large classes so that, in 
the event the students communicated in- 
formation to each other about examina- 
tions, the bias introduced would operate 
against the hypotheses. (Questioning of 
the students after examinations revealed 
that the amount of communication was 
just about nil.) 

All three sections were conducted by the 
lecture method and, as much as possible, 
the lecturer repeated exactly the same 
material to the three classes. During the 
last 10 minutes of the second and fourth 
lectures, the students in all three classes 
were given a “pop-quiz” which specifically 
covered the material that was presented in 
that lecture and the previous lecture. The 
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students had typically received a few 
weeks notice for examinations before this 
time and were clearly not expecting and 
had not prepared for these quizzes. 

In order to avoid biasing the lectures 
in favor of the quiz material, the quizzes 
were constructed by the junior author 
and were not seen by the lecturer until 
after they were administered. Both quizzes 
were multiple choice, the first containing 
15 questions and the second 8 questions. 
(The second quiz was originally designed 
as 15 questions but since only 8 of the 
items were actually covered in the lectures, 
the rest were discarded before the quizzes 
were scored.) 

The students took the final examina- 
tion about one week after the last day of 
classes. The final examination consisted of 
125 multiple choice items and was made 
up by several members of the department. 
It was administered at the same time to 
about 1,000 students, of whom 42 were 
the Ss for this experiment. Twenty-five 
of the items on the final examination cov- 
ered the same material which had been 
presented while the students were in the 
large versus small class experimental situ- 
ation. The two groups of 21 students were 
compared in their performance on the two 
quizzes and on these 25 items of the final 
examination. 


REsvULTs 


The two quizzes and the final examina- 
tion were scored by assigning one point 
for each correct answer. For each student, 
a total quiz score was obtained by adding 
the scores of his two quizzes, and differ- 
ence scores were then computed for each 
matched pair of students in the small and 
large classes. Because 13 of the 42 stud- 
ents were not present for one or the other 
of the two quizzes (seven students from 
the large classes and six students from the 
small class), the difference scores for these 
students were based on the one compara- 
ble quiz which both members of the 
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matched pair had taken. The difference 
scores revealed that 13 of the small class 
students did better than their matched 
pairs in the large classes whereas only 3 
did worse and there was no difference for 
5 students. The differences analyzed with 
a t test for matched pairs resulted in a t 
of 2.57, which for 20 degrees of freedom 
has a probability level of less than .02. 

An alternative method of analyzing the 
results was to correlate the two quizzes 
(the Pearson product-moment coefficient 
was .56) and to use the regression equa- 
tion to predict the scores on the quizzes 
missed by the absent students. This re- 
sulted in two quiz scores for each student 
which were added. The mean of this sum 
was 16.02 for the small class as compared 
to 14.34 for the matched students in the 
large classes. The standard error of the 
difference between the means was .56 
which resulted in a t of 3.00 which for 
seven degrees of freedom (since 13 of the 
seores were predicted) also has a prob- 
ability level of less than 02. 

On the 25 items of the final examina- 
tion, 10 of the small class students did 
better than their matched pairs in the 
large classes while nine did worse, and 
there was no difference for two students. 
The mean for the small class was 18.00 as 
compared to 17.53 for the matched stud- 
ents in the large classes. The standard 
error of the difference between the means 
was .90 which resulted in a t of 53 which 
is not significant. 


Discussion 


The results confirmed the hypotheses 
that differential performance would be 
found on quizzes which specifically cov- 
ered classroom material and for which the 
students had not prepared but that per- 
formance would be equal on final exam- 
inations for which the students had de- 
voted a large amount of extra study. One 
cannot be certain, of course, about the 
role attributed to extra study. The spe- 
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cific conclusion which can be drawn from 
the data is that in order to test the effect 
of a particular classroom technique, eval- 
uation should be done immediately after 
the technique is employed. Waiting until 
a final examination (or any other an- 
nounced examination) confounds the prob- 
lem by permitting many other variables to 
operate, one of the most obvious of which 
is extra study. 

In most experiments on learning, when 
one is measuring the effects of a particular 
variable, e.g., massed versus distributed 
practice, the learning opportunities of the 
Ss are limited almost exclusively to the 
experimental situation. This has not been 
the case in experiments on teaching meth- 
ods, perhaps because of the difficulties of 
controlling some of the extraneous vari- 
ables. It may be, however, that many of 
the experiments on teaching methods 
would have led to significant differences if 
the evaluation procedure avoided contam- 
ination by such factors as extra study. The 
fact that in previous studies, the variable 
of class size has so repeatedly been found 
to produce no significant effects on amount 
learned, and yet yielded significant differ- 
ences in the present experiment, implies 
that a restudy of other teaching variables 
using different measuring techniques might 
be more fruitful than it has previously 
been. 

Although the experiment was not pri- 
marily concerned with the effects of small 
versus large classes, the data indicate that 
students in the small class learned signifi- 
cantly more in class than did the students 
in a large class. The difference of about 1.7 
points on a test which had a mean score 
of about 15 was fairly large considering 
the relatively small variance of the differ- 
ence scores and the fact that the small 
versus large class variable was limited to 
only two lectures per quiz. Furthermore, 
there was no attempt to utilize the unique 
advantages of a small class by permitting 
more questions or discussions. 
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What was the specific variable operating 
in the large and small classes which pro- 
duced the difference? As Buxton (2) has 
pointed out, class size, like time, is not a 
variable by itself but rather is an abstrac- 
tion in which other variables may be per- 
mitted to operate. In evaluating the course 
at the end of the semester, almost all of 
the students in the small class spontane- 
ously commented that they found it easier 
to pay attention in the small class and 
that they had become more interested. 
Casual observations by the lecturer cor- 
roborated this. The students in the small 
class appeared to be much more alert in 
their listening as well as note-taking be- 
havior. Rarely, if ever, were they observed 
in behaviors which are not uncommon in 
very large classes, such as talking to each 
other, staring out the window, reading, etc. 
Undoubtedly, the very proximity of the 
lecturer in the small class acts as an in- 
hibitory influence on these behaviors. 

It is also possible, of course, that the 
variable of class size had nothing to do 
with the obtained differences. It may be 
that something like a “Hawthorne-effect” 
was operating in which the students in the 
small class felt more significant and more 
highly motivated because a special class 
had been established for them. This possi- 
bility, and similar ones, such as uninten- 
tional lecturer bias, or that students per- 
haps learn more at 8:00 a.m. than at 10:00 
or 11:00 am., do not reduce the signifi- 
cance of the major finding, namely that 
differences as a result of teaching tech- 
nique manipulations were found on “pop 
quizzes” but not on a prepared-for final 
examination. 


SuMMARY AND CONCLUSIONS 


Twenty-one students in a small class 
were compared on examination perform- 
ance with a matched group of students 
who were in a large class. It was hypothe- 
sized that the small class would do better 
on quizzes which specifically covered the 
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classroom material and for which the stud- 
ents had not prepared but that the two 
groups would do equally well on final ex- 
aminations for which they had studied. 
The hypothesis was confirmed and the im- 
plications of this methodological procedure 
were discussed in relation to other research 
- on the effectiveness of different teaching 
methods. 
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In a recent study (1) the results of a 
preliminary attempt to identify the di- 
mensions of job incentives among college 
students was reported. By means of a 
factor analysis of rankings of a homogene- 
ous list of job incentives by college Ss three 
dimensions were isolated and tentatively 
identified as: (a) need achievement vs. 
fear of failure; (5) interest in the job it- 
self vs. the job as arn opportunity for ac- 
quiring status; and (c) job autonomy of 
supervision vs. supervisor dependency. 
Simple procedures were developed for com- 
puting factor “scores” for individual Ss 
from the differences between pairs of 
ranked incentives. 

If the dimensions underlying the ranked 
incentives are related to choice of occu- 
pational goal among college Ss, we would 
expect that Ss divided into more homo- 
geneous subgroups on the basis of their 
curricular courses of study would show 
somewhat greater agreement in their rank- 
ing of the incentives than the combined 
curriculum heterogeneous total group of 
Ss and that the factor scores would be 
capable of demonstrating significant dif- 
ferences among the curriculum subgroups. 
Curriculum subgroup differences would 
have the added advantage of helping to 
define the meaning of the job incentive 
dimensions. 

The present research was designed to 
(a) identify, on an a priori basis, curricu- 
lar subgroups of Ss, (b) combine these 
subgroups into major curriculum group- 
ings by means of an empirical factor anal- 
ysis of their subgroup profiles in ranking 
job incentives, and (c) test whether there 
were significant differences among these 
major curriculum groupings on the de- 
rived factor scores. 


PROCEDURE 


A form was prepared which listed the 
eight incentives used previously (1) and 
it was administered to 267 undergraduate 
college Ss (174 men and 93 women) en- 
rolled in 10 sections of undergraduate psy- 
chology courses. The form requested the S 
to indicate his (her) name, age, sex, school, 
and curriculum group within the univer- 
sity, major subject and to write a brief de- 
scription of the job he expected to accept 
after graduation. The S was then requested 
to rank (from one to eight) the following 
list of incentives with the incentive that 
would be most important to him in se- 
lecting the job previously described being 
ranked “one” and the least important 
incentive being ranked “eight.” The in- 
centives used were: 

. Opportunity to learn new skills 
. Friendly fellow workers 
3. Freedom to assume responsibility 
. Good job security 
. Good prospects for advancement 
. Full insurance and retirement bene- 
fits 
. Recognition from 
initiative 

8. Good salary 

The 267 Ss who completed this form 
were dichotomized as to sex and further 
divided into eight major curriculum areas: 

Business Administration: 51 men. 

Engineering and Mines: 42 men. 

College B.A. (Humanities and Social 

Sciences): 35 men and 11 women. 

College B.S. (Natural Sciences) : 30 men 

and 11 women. 

Pre-Education: Elementary; 26 women. 

Pre-Education: Secondary; 16 men and 

25 women. 

Pre-Nursing: 11 women. 


supervisors for 
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Nursing Education: 9 women. 
This division by sex and curriculum re- 
sulted in 11 curriculum subgroups. 

Two sets of “scores” were available for 
each S. The eight rankings of the incen- 
tives constituted an incentive profile for 
the S and these rankings were averaged 
for each of the 11 subgroups. A second set 
of factor “scores” had been developed in 
the previous study (1). The Factor A 
score was defined as the ranking of Incen- 
tive 6 minus the ranking of Incentive 1. 
Similarly, the Factor B score was Incen- 
tive 8 minus Incentive 2 and Factor C 
was Incentive 7 minus Incentive 4. Posi- 
tive scores computed in this manner indi- 
cate (a) high need achievement, (b) high 
interest in the job itself, and (c) high need 
for autonomy from supervision. Negative 
factor scores presumably measure (a) high 
fear of failure, (b) strong attitude toward 
the job as a stepping-stone for advance- 
ment, and (c) high need for a dependency 
relation to the job supervisor. 
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REsvuULTs 


The average (mean) incentive profiles 
for each of the 11 curriculum subgroups 
can be found in Table 1. From the sums 
of ranks of the eight incentives for each 
subgroup the average rank-difference 
(rho) intercorrelation among the Ss con- 
stituting each subgroup was computed by 
the usual formula (2, p. 421) and these 
coefficients are given in the last column 
of Table 1. Nine of the 11 coefficients are 
significantly different from zero at the 01 
level of confidence with the remaining two 
coefficients being significant at the .05 
level. The average rho intercorrelation for 
the total group of 267 Ss (ignoring cur- 
riculum subgrouping) was .20 (significant 
at the .01 level). The median average inter- 
correlation of the 11 subgroups in Table 
1 is 28 which can be compared to the 
average intercorrelation of .20 when the 
curriculum subgroups are combined, indi- 
cating that the division of Ss into curricu- 
lum subgroups did somewhat increase the 
homogeneity with which the Ss ranked 
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the incentives. The median intercorrelation 
for the five male curriculum subgroups 
was .26, while the median coefficient for 
the six female subgroups was .32, sug- 
gesting that the female Ss were slightly 
more homogeneous in ranking the incen- 
tives than were the male Ss. However, 
the 11 coefficients were ranked and the 
Mann-Whitney U test (4, pp. 116-127) 
was used. The two-tailed probability was 
approximately .21 which indicates that 
the hypothesis of similar homogeneity 
among men and women Ss in ranking the 
incentives cannot be rejected. Two of the 
three smallest average intercorrelations 
were contributed by the college B.A. sub- 
groups and it is suggested that in future 
curriculum research the two major samples 
of Ss within these subgroups, majors in 
the humanities and in the social sciences, 
be treated separately in finding incentive 
profiles. The intercorrelation among the 
Pre-Education-Secondary men might be 
increased by splitting these Ss in future 
samples into those majoring in physical 
education and Ss majoring in more aca- 
demic high school subjects. 

In order to combine the 11 curriculum 
subgroups into more meaningful major 
groupings by means of factor analysis the 
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eight incentive mean ranks for each of 
the 11 curriculum subgroups were ranked 
and the intercorrelations among the 11 sets 
of ranks were computed by the rank- 
difference method. The mean correlation 
among the subgroups was .60 with indi- 
vidual coefficients ranging from .01 to .98. 
Nineteen of the 55 intercorrelations were 
significant at the .05 level of confidence. 
This matrix was factor analyzed by the 
usual centroid method and three orthogo- 
nal factors were extracted. The median 
absolute value of the residuals after ex- 
traction of the third factor was only .03, 
indicating the absence of a fourth factor. 
The original factors were rotated to simple 
structure and the rotated orthogonal fac- 
tor loadings for the curriculum subgroups 
can be found in Table 2. 

The 11 curriculum subgroups appeared 
to condense into six major groupings on 
the basis of their patterns of loadings on 
orthogonal Factors X, Y, and Z, and their 
factor pattern groups (I to VI) are indi- 
cated in Table 2. Factor X, which ac- 
counts for 40 per cent of the variance 
among the subgroups, is obviously a sex 
factor with the male subgroups all having 
high loadings on this factor and the female 
subgroups showing moderate or low load- 


TABLE 2 
Factor ANALYSIS OF RANK INTERCORRELATIONS AMONG CURRICULUM SUBGROUPS 








Factor 


Pattern Curriculum Subgroups 





College B.A. M 
Business Administration M 
Engineering & Mines M 
Pre-Educ: Secondary M 
College B.S. i M 
Nursing Education 
College B.S. 
Pre-Educ: Secondary 
Pre-Educ: Elementary 
Pre-Nursing 

College B.A. 


Pe 











Sex 
Group 


Fact di 
ae actor Loadings 


Students 


35 
51 
42 
16 
30 

9 
11 
25 
26 
11 
11 


SHeasssexes | x 























CURRICULAR DIFFERENCES IN JOB INCENTIVES 


ings. Factor pattern Group VI is similar 
to Group V except for an extremely low 
loading on the “masculinity” Factor X 
which appears to justify the separation of 
Group VI (College B.A. women) from the 
three female subgroups comprising Group 
V. Comparing the men and women Ss on 
the mean ranks they assigned to the indi- 
vidual incentives indicated that the men 
preferred Incentives 5 (Good prospects 
for advancement) and 8 (Good salary), 
while the women reported that Incentives 
1 (Opportunity to learn new skills) and 
3 (Freedom to assume responsibility) 
would be more important in their job 
selection. 

Factor Y (37 per cent of the intersub- 
group variance) appeared to be almost 
the reverse of the “masculinity” Factor 
X with the exception that the male sub- 
groups within Group III (Pre-Education: 
Secondary and College B.S.) had moderate 
loadings on Factor Y (median loading = 
AS) that were intermediate in size be- 
tween the loadings of the six female sub- 


groups (median loading = 81) and the 
remaining three male subgroups (median 
loading = .09). The factor pattern groups 
were trichotomized as to their loadings on 
Factor Y with Groups V and VI being 
high (median = .88), Groups IIT and IV 


being intermediate (median = 49) and 
Groups I and II having low loadings 
(median = .09). Incentives 3, 5, and 8 
appeared to be related to Factor Y with 
subgroup preferences for Incentive 3 be- 
ing positively correlated with Factor Y 
loadings and Incentive 5 and 8 preferences 
being negatively related to the loadings. 
Factor Y might be labelled a “tender- 
minded social service” vs. a “toughminded 
practical” dimension from an inspection 
of the curriculum subgroup loadings along 
this factor. However, “naming” of this 
factor is less important than is the use 
of the factor loadings in empirically 
clustering the subgroups. 
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Factor Z (12 per cent of intersubgroup 
variance) was a triplet factor with the 
three subgroups included in Factor Pattern 
Groups II and IV showing large loadings 
and the remaining eight curriculum sub- 
groups showing zero loadings. Groups II 
and IV seem to prefer Incentive 1 more 
than did the remaining groups, but com- 
parisons of the high and low Factor Z 
groups showed slight differences on their 
mean rankings of the other incentives. 
We might suggest that Factor Z repre- 
sents an “interest in science and technol- 
ogy” factor, but again it should be empha- 
sized that identification of this factor by a 
name was not an aim of this study. 

Three factor scores had been computed 
for each of the 267 Ss in the 11 curriculum 
subgroups and three analyses of variance, 
one for each factor score, were computed 
to test for significant differences among 
the subgroups. The total variance (df = 
266) of a factor score was first split into 
the variance between the means of the 
11 subgroups (df = 10) and the residual 
individual differences variability among the 
Ss within the subgroups (df = 256). The 
11 within subgroup variances were tested 
for homogeneity by Bartlett’s chi-square 
technique before pooling and the chi- 
square values gave no evidence of signifi- 
cant heterogeneity for any of the three 
factor scores. The between subgroups vari- 
ance was further divided into two compo- 
nents: between factor pattern groups 
(df = 5) and the residual variability 
among the means of the subgroups within 
the factor pattern groups (df = 5). The 
summaries of these analyses of variance 
can be found in Table 3. All three factor 
scores discriminated among the subgroups 
at the .01 level of confidence. The differ- 
ences among the factor pattern groups 
were significant at the .01 level for all 
three scores, while the residual variation 
among the subgroups comprising the pat- 
tern groups did not approach statistical 
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TABLE 3 


ANALYSES OF VARIANCE oF CuRRICULUM SuBGROUP 
DIFFERENCES IN Factor Scores 
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significance. Thus all of the variation in 
mean factor scores could be accounted for 
by the grouping of the curriculum sub- 
groups accomplished through the factor 
analysis of subgroup profiles reported 
above. 

The intercorrelations among the factor 
scores were computed for each of the 11 
subgroups and averaged (weighted mean). 
The average intercorrelations (df = 245) 
among the factor scores were: A and B, 
r = .16 (significant at the .05 level); A 
and C, r = —.24 (significant at the 01 
level) ; and B and C, r = —.05 (not sig- 
nificant). Although two of the three fac- 
tor score intercorrelations are statistically 
significant for this large sample of Ss, all 
three are low enough to indicate that the 
scores are relatively independent measures 
of job incentive dimensions. 

The mean factor scores for each of the 
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pattern groups were computed and are 
given in Table 4. For Factor A, Group 
IV appears to be quite high in “need 
achievement” while Groups I and III 
show a high degree of “fear of failure.” 
Groups V and VI have mean factor scores 
(Factor B) indicating strong interest in 
the job, while the other groups, particu- 
larly Group I, view the job as a means of 
advancing their own status and position. 
The average Factor C scores show Groups 
III and V to prefer jobs that are free of 
close and immediate contact with super- 
vision and Group VI Ss express a need 
for a strong and intimate dependency re- 
lationship with their supervisors. 


Discussion 


From one point of view the present 
study had two questions: (a) whether the 
somewhat crude factor “scores” devel- 
oped in the previous study (1) were valid 
in discriminating differences between sub- 
groups of Ss who had chosen different 
college curricula, and (b) whether a fac- 
tor analytic grouping of the curriculum 
subgroups would account for a sizable 
amount of the intersubgroup variability 
in factor scores and provide information 
about the dimensions of job incentives 
for college Ss. Table 1 indicates that sev- 
eral mistakes were originally made in de- 
fining the (assumed) homogeneous curricu- 
lum subgroups. Both the men and women 
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CURRICULAR DIFFERENCES IN JOB INCENTIVES 


college B.A. subgroups should have been 
further split to achieve more homogeneity ; 
probably by separating Ss majoring in the 
Humanities from those majoring in the 
Social Sciences. Similarly, the men in the 
Pre-Education subgroup should be divided 
into more similar subgroups. However, 
these errors in the a priori subgrouping 
of the Ss could not be corrected after the 
Table 1 results were known without leav- 
ing the study open to the charge of data 
manipulation. These subgrouping errors 
appear to have had little effect on the 
answers to the original two questions. 

The 11 subgroups condensed nicely into 
six broader groupings on the basis of the 
factor analysis reported in Table 2. Al- 
though we labelled the three obtained 
Factors X, Y, and Z, as “masculinity,” 
“tenderminded social service vs. tough- 
minded practicality,” and “interest in 
science and technology,” it must again be 
emphasized that we are not convinced that 
there are adequate appellations and that 
“naming” of the factors was unimportant 
for our purpose. We were interested solely 
in the use of factor analysis here to allow 
us to combine curriculum subgroups into 
a more parsimonious set of major group- 
ings. 

The answers to the two major research 
questions are found in Table 3. The fac- 
tor scores showed differences among the 
11 curriculum subgroups that were highly 
significant. Parenthetically it might be 
noted that the data given in Table 3 al- 
lows the computation of epsilon correla- 
tion coefficients to provide rough indices 
of the “validity” with which the Factor 
A, B, and C scores discriminated differ- 
ences among the subgroups. Epsilon is a 
curvilinear correlation method similar to 
the eta correlation coefficient included in 
most statistics texts and is based upon 
the ratio of the variance of the means of 
the subgroups to the variance within the 
subgroups (3, pp. 319-324). These validity 
coefficients were .31, 35, and .27. The 
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average intercorrelations among the factor 
scores were also low enough to estimate 
that each score provided some independent 
information about subgroup differences. 
Table 3 also shows that practically all 
of the subgroup differences in scores were 
accounted for by the differences among 
the factor pattern groups. Another way of 
saying this is that the curriculum sub- 
groups included within the six larger 
groupings were quite homogeneous in fac- 
tor scores. Curriculum differences account 
for approximately 10, 12, and 7 per cent 
of the inter-subject variability in factor 
scores with the six factor pattern groups 
accounting for practically all of this cur- 
riculum-related variance. 

It seems fair to conclude from this pre- 
liminary study that (a) the method of 
factor analysis provides a technique by 
which the dimensions of job incentives 
can be isolated and relatively independent | 
factor scores can be estimated for indi- 
vidual Ss, and (6) these factor scores are 
significantly related to college Ss’ choices 
of academic curricula. However, further 
expansion and delineation of the factors 
underlying job incentives is needed with 
a consequent development of additional 
factor scores that can be estimated more 
reliably. 


SuMMARY 


A list of eight job incentives was ranked 
by 267 college Ss and the Ss were divided 
into 11 subgroups on the basis of sex and 
college curriculum variables. A factor anal- 
ysis of the subgroup incentive profiles iso- 
lated three factors (tentatively labelled 
“masculinity,” “tenderminded social vs. 
toughminded practicality,” and “interest 
in science and technology”) and condensed 
the 11 subgroups into six major curricu- 
lum groupings on the basis of the subgroup 
patterns of factor loadings. Scores on three 
factors isolated in a previous factor analy- 
sis of the incentives were computed for 
individual Ss and significant differences 
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(.01 level) were found among the six 
factor pattern groups for all three factor 
scores. 
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RELATIONSHIP OF INTELLIGENCE AND SOCIAL POWER TO 
THE INTERPERSONAL BEHAVIOR OF CHILDREN? 
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There are contradictory beliefs about 
the behavior of highiy intelligent children 
in school, and particularly about their 
participation in problem-solving discus- 
sions. These children are described as 
both influential and impotent, tolerant 
and impatient, supporting and rejecting, 
eager and bored. Little information exists 
which can help us to separate fact from 
fancy among these assertions, nor do we 
know much more about the ways in which 
intelligent persons differ from less intelli- 
gent ones in these respects. 

There is a reason for this lack of in- 
formation. Intelligence as a concept is 
primarily intended to describe an ability 
to deal with cognitive problems. There 
are few elements in definitions of intelli- 
gence which suggest that variations in in- 
tellectual ability are associated with varia- 
tions in face to face behavior. Hence, 
intelligence has seldom been used as an 
independent variable in studies of social 
behavior. 

In a decision making group, however, 
a person with high intelligence may per- 
haps offer wiser observations than one with 
low intelligence. Because of his greater 
usefulness, we can anticipate that a 
brighter child would be more influential 
than a less intelligent one. Because he is 
more expert, he should have greater so- 
cial power, that is, a greater ability to in- 
fluence others. On the basis of assumptions 
like these, it is apparent that intelligence 
can be a cause for particular types of inter- 
personal behavior. Some children, regard- 


*The research reported herein was per- 
formed pursuant to a contract with the 
United States Office of Education, Depart- 
ment of Health, Education and Welfare. 


less of their degree of intelligence are able 
to exercise strong influence on their peers, 
while others are consistently ignored by 
classmates. 

How then does a person’s intelligence 
affect the way he acts toward others when 
his group must reach a decision? Does he 
behave differently when he is used to hav- 
ing his ideas accepted (has high social 
power) than when he is used to being ig- 
nored? These are primary interests in this 
study. 

Boys and girls are expected and re- 
quired to behave differently in social set- 
tings. Boys, for example, are more often 
pressed than are girls to be concerned 
with achievement and influence. Because 
prescriptions for the two sexes differ, it is 
probable that the meaning and effects of 
intelligence or social power differ for boys 
and girls. A secondary purpose of this 
study is to examine the impact of intelli- 
gence and social power on the interper- 
sonal behavior of boys compared to girls. 

Toward these purposes, measures were 
made of the intelligence and social power 
of all children in a number of classrooms. 
These persons were then put in standard- 
ized, small, problem-solving groups. Their 
participation was observed in terms of pre- 
coded categories to see how those with dif- 
ferent degrees of power and intelligence 
differed in their behavior. Data were also 
obtained from teachers and classmates 
concerning characteristics of these children 
in regular classrooms. 


Major Issues 


In the small discussion groups, we as- 
sumed children would behave in the ways 
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they typically do in their classes to try to 
win acceptance for their ideas. 

It was expected that intelligence should 
make a difference in the actions initiated 
by them, as already mentioned, because 
intelligent persons have more resources to 
offer in a problem-solving discussion than 
do less intelligent ones. Also, they might 
have more confidence in their own pro- 
posals, stemming from the ready accept- 
ance of them in the past. Highly intelli- 
gent pupils, therefore, were expected to 
make more efforts to influence others, to 
have their ideas accepted more often than 
less intelligent pupils, and to behave in 
ways which could be taken as typical! of 
persons with confidence in themselves. 


Children with greater social power were . 


expected to make more efforts to influence 
others, to be more successful in doing so, 
and to behave in ways which have been 
shown in other studies to be typical of 
persons with greater power (2, 3, 4, 5, 
6). 

How do teachers characterize children 
who have different degrees of intelligence 
and social power? To determine this, 
teachers were asked to rate the behavior 
of Ss in categories roughly similar to those 
used in observing the behavior of the 
children in the small groups. What as- 
pects of the teachers’ opinions, based on 
day by day experience with these children, 
support or contradict the behavior shown 
in the standardized group situations? Were 
teachers able to differentiate between the 
behavior of one type of person and that 
of another? 

Finally, it is useful to know how class- 
mates characterized the children to whom 
they ascribed high social power as com- 
pared to those to whom they attributed 
little power. The Ss were rated by their 
peers concerning their ability in school- 
work, their attractiveness, and their abil- 
ity to coerce or threaten others. These 
personal qualities were considered to be 
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separate sources of social power as sug- 
gested by French and Raven (2). It was 
anticipated that persons with greater so-— 
cial power, in contrast to those with less, 
would have these qualities ascribed to 
them more often by classmates. Do highly 
intelligent persons differ from the less in- 
telligent in these respects? 


METHOD 


Data used in this research were origi- 
nally collected by colleagues for a differ- 
ent but related purpose.’ In the original 
investigation, measures were obtained con- 
cerning all children in 16 second grade and 
16 fifth grade classrooms, representing all 
socioeconomic levels in a medium-sized 
city. Children were selected in that study 
for a field experiment concerned with the 
creation of changes in the social position 
and behavior of group members. The 
measures were made in order to establish 
a baseline, so that the amount of change 
in a participant’s behavior could be de- 
termined. The data employed in the pres- 
ent investigation are from these pre-ex- 
perimental measures. 

From the original population of 638 
children on whom measurements were 
available, Ss were chosen for this study 
who had degrees of intelligence and power 
in required combinations. Pupils desig- 
nated as high in intelligence were those in 
the upper 33 per cent of their class and 
those designated as low in intelligence 
were in the lower 33 per cent. Children 
designated as high in power were in the 
upper 50 per cent of their class and those 
designated as low in power were in the 
lower 50 per cent of their class on this 
measure. The sample included 226 boys 


* The collection of the original data was 
supported by Grant-in-Aid (M-919) from 
the National Institute of Mental Health, 
National Institutes of Health, US Public 
Health Service. Ronald Lippitt was project 
director. We are grateful for his kindness in 
furnishing the data for our use. 
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and 192 girls, 230 second grade children 
and 188 fifth graders. 

The type of data available for each 
child and the source of each are as fol- 
lows: 

Intelligence. Intelligence scores were ob- 
tained from school records. They were based 
on. the results of the Kuhlman-Anderson 
test, administered in a group form. 

Ratings by classmates. Every class mem- 
ber rated every child in his class on four 
characteristics: social power (who can most 
often get you to do things for him?), at- 
tractiveness, ability in school work, and 
ability to threaten others. To obtain these 
ratings, photographs of every child in the 
classroom were printed on sheets with a 
four-point rating scale next to each picture. 

Observed behavior in groups. The mem- 
bers of a class were divided into four 
smaller groups on a random basis and each 
group was sent to a corner of the room 
where they worked on assigned problems. A 
trained observer was stationed in each cor- 
ner who recorded on a precoded observation 
schedule the quantity and quality of behav- 
ior initiated by each child* Four problems 
were assigned to the groups, each of which 
required a group decision as a first step in 
progress toward completion of the task. In 
one problem, for example, the group built 
a large tinker-toy and in another arrived at 
a decision as to how many beans were in a 
bottle. At the end of each task new groups 
were formed for the next problem, thus pro- 
viding maximum opportunity for each child 
to interact with every other child in the 
room. No chairmen were designated for these 
discussions. 

The observed behavior was coded into the 
following categories: 

1. Influence attempts—efforts to influence 
others regardless of the manner employed. 

2. Successful influence attempts—efforts 
in which compliance was obtained from 
others. 

3. Unsuccessful influence attempts—efforts 
in which compliance was not obtained from 
others. 

4. Demanding influence attempts—com- 


* The observers had satisfactory reliability 
among them. Information concerning reli- 
ability of the observations will be available 
in publications concerning the project from 
which these data were borrowed. 
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ments made in an ordering or directing man- 
ner, implying autonomy for the actor. 

5. Suggestions—comments indicating weak 
proposals to, or requests of, others. 

6. Valuing of others—behavior indicating 
recognition of either high or low resource- 
fulness of another person in an area of 
knowledge or skill. 

7. Positively valuing others’ behavior— 
comments indicating recognition of high 
value in others’ behavior. 

8. Negatively valuing others’ behavior— 
comments indicating recognition of low value 
in others’ behavior. 

9. Affect-laden behavior—behavior in 
which overt friendliness or unfnendliness is 
observed which is not a direct attempt to in- 
fluence another. 

10. Aggressive behavior—acts of aggres- 
sion which are either inflicted or threatened. 

11. Mean weighted directness in style— 
ratio of frequency of forceful to nonforceful 
forms of behavior toward others. 

Perceptions by teachers. Every child was 
rated by his teacher on seven characteristics 
descriptive of the typical social behavior of 
the person in his schoolroom. The teacher 
was asked to indicate on a five-point scale 
the extent to which each child showed the 
behavior under consideration (from “hardly 
ever” to “most often”). The qualities rated 
are shown in Table 6. 

Teachers did not know how much social 
power classmates had attributed to the Ss 
at the time they made their own ratings. 
Information concerning the intelligence of 
these children was available to teachers in 
school records. 


REsULTs 


Characteristics attributed to the Ss by 
their peers, for boys and girls with differ- 
ent degrees of power, are first discussed. 
Results are then presented for (a) the ob- 
served behavior of Ss in the small groups 
and (b) the perceptions of Ss’ daily be- 
havior by teachers. Finally, the results 
are summarized and interpreted. 

For the sake of brevity the data in 
tables are usually limited to statistically 
significant findings. The omission of re- 
sults for a specific category of behavior 
indicates that no significant differences 
were obtained for it. 

At the outset it should be noted that 





260 


social power is not highly correlated with 
intelligence. The correlation for boys was 
.20 and for girls .28. The low relationships 
between intelligence and power indicates 
that they may vary quite independently. 
Since both of these correlations are sig- 
nificant at the .01 level of probability, 
however, it is also clear that brighter 
children tend to have more power than 
less intelligent ones. 


Characteristics Attributed to Subjects by 
Classmates 


Both boys and girls who were attributed 
high social power were more attractive to 
classmates than those low in power, re- 
gardless of their intelligence. These results 
may be seen in Table 1. Girls with greater 
power were rated as more able in school- 
work than girls low in power irrespective 
of their intelligence, while boys were de- 
scribed as more able in schoolwork to a 
significant degree only among those high 
in power and intelligence. Boys with higher 
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power were described as more threatening. 
This was not true for girls. 

Boys and girls with higher intelligence 
were seen by classmates as significantly 
more able in schoolwork than less intelli- 
gent persons (M 2.24 and 1.76, p of diff. = 
01). Girls high in intelligence were rated 
as more attractive than those low in intelli- 
gence (M 2.40 and 2.01, p of diff. = .005). 
Ability to threaten was not related signifi- 
cantly to intelligence. 


Benavior or Boys in SMALL Groups 


We consider the types of behavior boys 
used in the problem solving discussions. 


Effects of Variations in Power 


Among highly intelligent boys, those 
high in social power were not significantly 
different in any category of observed be- 
havior from those low in power. 

Among less intelligent boys, those high 
in power, compared to those low in power, 
revealed a vigorous, inconsistent, and com- 


TABLE 1 


MEANS OF CHARACTERISTICS ATTRIBUTED BY CLASSMATES TO 
CHILDREN WITH VARIED SociaL PowER 











Boys 





Children high in 


intelligence Social Power 





High M | Low M 


Social Power 








High M 





2.13 
2.19 
3.02 


Attractive 
Able in school work 
Threatening 


N 58 








2.09 1.50 
2.14 1.58 
2.89 2.86 





73 











Children low in intelligence 





Attractive 
Able in school work 
Threatening 


N 
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petitive pattern of behavior. The quanti- 
tative results may be seen in Table 2. 


Effects of Variations in Intelligence 


Among boys with high power, the be- 
havior of the more intelligent was in no 
way significantly different from that shown 
by the less intelligent. 

Among boys with low power, those with 
greater intelligence, compared to those 
with less, were active, effective, and sup- 
portive of others. These data are sum- 
marized in Table 3. 


Beuavior or GIRLS IN SMALL Groups 


Effects of Variations in Power 

Among girls with high intelligence, those 
with high power were not observed to be- 
have differently from those with low 
power. 

Among girls who were less intelligent, 
those with high social power were only a 
little different from ones low in power. 
Those with more power were more often 
successful in their influence attempts (M 
8.81 and 5.61, p of diff. = 01) and were 
more positive in their remarks about 
others’ behavior (M 1.28 and 58, p of 
diff. = 01). 


Effects of Variations in Intelligence 


Regardless of their social power, varia- 
tions in intelligence were not associated 
to a significant degree with differences in 
girls’ observed behavior. 


Benavior or Boys anp GIRLs 
DirectLy CoMPARED 


A direct comparison of the behavior 
used by boys and girls, regardless of their 
power or intelligence indicates the actions 
most typical of each sex. The following 
behaviors were observed among boys sig- 
nificantly more often than among girls: 
attempts to influence, successful influ- 
ences, unsuccessful influences, aggression, 
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TABLE 2 


Bexavior or Less INTELLIGENT Boys 
VARYING IN SocraL Power 








Social Power 


| Low M 


16.00 





Total influence at- 3.18** 
tempts 
Freq. successful in- 
fluence attempts! 
Freq. unsuccessful! 10.48 
influence at- 
tempts 
Freq. <a 5.33 


13.87 | 7.96 | 3.16°* 


8.04 | 1.98* 


2.10° 
influence at- 
tempts 
Freq. suggestions | 18.90 
Total valuing of | 2.48 
others’ behavior} 
| 


3.13** 
2.03* 








N 46 





TABLE 3 
Grovup Benavior or Bors wita Low 
Soci1aL Power VARYING 
IN INTELLIGENCE 








| Intelligence 


| = 
High M| Low M 





Total influence at-| 21.63 | 16.00 
tempts 

Freq. successful in- 
fluence attempts 

Freq. suggestions 

Freq. positively 
valuing others’ 
behavior 


11.97 | 7.96 


16.86 | 12.30 
1.37 89 


N 














and demands. Girls did not display any 
type of behavior significantly more often 
than boys. It is evident that boys were 
considerably more active and demanding 
in their groups than were girls. 





262 


ALVIN ZANDER AND ELMER VAN EGMOND 


TABLE 4 
Benavior oF HiGHiy INTELLIGENT Boys Versus Girts VARYING IN SociAL Power 


Social Power 


Low 
Girls 


Low | 
Boys 
M 





21.63 
11.97 


Total influence attempts 

Freq. successful influ- 
ence attempts 

Freq. unsuccessful in- 
fluence attempts 

Freq. suggestions 

Total valuing of others’ 
behavior 

Freq. demanding influ- 
ence attempts 

Freq. aggressive behav- 
ior 

Total affect-laden re- 
marks 

M weighted demanding- 
ness in behav. 


4 











*p= 05. 
“p= Ol. 
*** p = 001. 


When members of both sexes were high 


in intelligence but low in power, boys were 
more active and aggressive than were girls. 
These data are shown in Table 4. 

Where members of both sexes were 
high in both intelligence and power, the 
patterns of behavior were similar to those 
just described. In addition, high-high boys 
were demanding in their comments and 
used more affect-laden types of behavior 
than high-high girls (see Table 4). 

In contrast to the girls, then, highly 
intelligent boys were likely to be active 
in their groups regardless of their social 
power and likely to be aggressively in- 
sistent in stating their opinions when high 
in both power and intelligence. 

Among the less intelligent children the 
boys again appeared to be more involved 
in their groups than the girls if they were 
high in power. A comparison of the be- 


1 

Social Power 
' 

|—__— 

| 


| High 
| Girls 


M 


| 16.69 
9.79 


2.30* 
1.80* 


2.39%* | 9. 6.90 
| 


7... 
.97 


13.80 


.28 : 2.61 
2.36°* 
3.92*** 


50 


havior of boys and girls is presented in 
Table 5. Boys with low intelligence and 
low power were very little different from 
girls with low intelligence and power. 


Teacuers’ Perceptions or Boys 


We turn to an examination of the 
qualities teachers attributed to these chil- 
dren. 


Effects of Variations in Power 


Among highly intelligent boys, the 
teachers made clear distinctions on every 
category between boys who were high in 
power and those who were low in power. 
These results are shown in Table 6. 

Among less intelligent boys, the teachers 
made similar distinctions in the charac- 
teristics they attributed to boys high in 
power and those low in power. These re- 
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TABLE 5 
Benavior or Less InreLLiGent Boys Versus Giris as RELATED To Social Power 


Social Power 


High 
Boys 


High 
Girls 


M 


Total influence attempts) 24.3: 15.96 

Freq. successful influ- 3.87 8.81 
ence attempts 

Freq. unsuccessful influ 8 15 
ence attempts 

Freq. suggestions 

Freq. positively valuing .61 
others’ behavior 

Total valuing others’ 2.48 54 
behavior 

M. weighted demand- 
ingness 

Freq. aggressive  be- 
havior 


*p= 05 
*p= Ol 
p> = Ol 


sults are listed in Table 6. In the eyes of 
teachers, boys with greater power are 
strikingly different in their social behavior 
from those with less power. 


Effects of Variations in Intelligence 


Teachers made no distinctions to a sig- 
nificant degree between boys high in in- 
telligence and those low in intelligence, re- 
gardless of their social power. 


Teacuers’ Perceptions or GIRLS 


Effects of Variation in Power 


Among girls high in intelligence, those 
with high social power were seen to be 
different from girls low in power as shown 
in Table 7. 

Among girls low in intelligence those 
with high social power were perceived by 
teachers to be different from those with 
low social power in only two respects. 


Social Power 


Low Low 
Boys Girls 
M M 


16.00 13.49 
7.96 5.61 


8.04 7.88 


12.31 9.75 
89 


1.76 
21.95 


3.13 


Girls with higher power were seen as more 
often successful in influencing others (M 
4.82 and 3.77, p of diff. = 005) and more 
friendly (M 4.30 and 3.46, p of diff. = 
005) . 


Effects of Variations in Intelligence 


Among girls with greater social power, 
teachers saw girls with high intelligence 
as being little different in their classrooms 
from those with low intelligence: the girls 
were viewed as making more attempts 
to influence classmates (M 4.26 and 3.80, 
p of diff. = .02) and as more successful 
in these attempts (M 3.77 and 3.19, p of 
diff. = .005). 

Among girls with little social power, 
highly intelligent girls compared to less 
intelligent ones were seen by teachers as 
making more frequent attempts to exercise 
influence in the class (M 4.29 and 3.76, p 
of diff. = .02), as more successful in these 
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TABLE 6 
CHARACTERISTICS TEACHERS ATTRIBUTE TO Boys WITH 





Boys high in intelligence 


DIFFERENT DEGREES oF SociaL PowER 


Boys low in intelligence 





Social Power 





High | Low 
M | 


4.05 | 
4.10 





Freq. influence attempts! 

Amt. success as influ- 
encer 

Friendliness 

Depends on teacher 

Depends on peers 

Self-centeredness 

Degree of forcefulness 


N 











*p= 06. 
p= Ol. 
** p = 001. 


efforts (M 4.82 and 3.78, p of diff. = .005), 
and as more friendly (M 4.29 and 3.62, p 
of diff. = .005). 


TeacHers’ Perceprions or Boys 
AND GirRLs COMPARED 


The perceptions teachers had of boys 
and girls differed to a significant degree 
only where the merabers of both sexes 
were high in power while low in intelli- 
gence. Here, boys were seen as more active 
in making attempts to influence class- 
mates and more self-centered in doing so 
than were girls. 


DiFFERENCES Due To AGES 
or SuBJECTS 


The results on all dimensions for second 
graders were compared with those of fifth 
graders. There was no acceptable evidence 
that differences in ratings of Ss by peers 
or teachers, or in their observed behavior 
in. the small groups, were associated with 
differences in age. 


Social Power | 











TABLE 7 
CHARACTERISTICS TEACHERS ATTRIBUTE TO 
Higuiy INTELLIGENT GIRLS WITH 
DirFeRENT DEGREES oF SociaL 
PowER 


Social Power 
| | 
i— 
| High | 
| M | M | 


Low | 


| | 


Amt. success as in- | 3.78 | 3.19 | 3.39°* 
fluencer 

Friendliness 

Depends on teacher 

Self-centeredness 


N 





*p= 06. 
"p= Ol. 


SumMary AND Discussion or Resutts 


Characteristics Attributed by Peers 


Boys and girls were similar in that those 
with greater power, compared to those 
with less, were better liked by their peers. 
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Attractiveness then was an important 
basis of power for both sexes. 

Boys and girls were different in that 
boys with greater power were seen by 
classmates as more threatening, and girls 
with greater power were rated as more 
expert “in the things you do in school.” 

These results suggest that among these 
children the two sexes won social power 
in part on the basis of different at- 
tributes, boys earned it by being threat- 
ening and girls earned it by being skilled 
in the things required of a school child. 


Observed Behavior in Groups 


The behavior of boys may be briefly 
summarized by noting that those who were 
low in social power and low in intelligence 
(low-lows) were strikingly different from 
boys with all other combinations of in- 
telligence and power. The significant dif- 
ferences we have seen in the behaviors of 
boys stem primarily from the fact that the 
low-lows were passive persons in their 
groups, more than boys who were high in 
either power or intelligence. The low-lows 
significantly less often tried to influence 
others, were less successful in doing so, 
were less demanding in manner, less often 
evaluating in respect to others’ contribu- 
tions; and less often suggestors of tentative 
proposals. Boys with various combinations 
of power and intelligence other than low- 
low were in no way different from one an- 
other to a statistically significant degree 
in their observed behavior, that is, high- 
highs were not different from high-lows 
or from low-highs. It is worth special note 
that highly intelligent boys whose social 
power was low behaved in no way differ- 
ent from those whose social power was 
high. 

Because the low-lows were different from 
boys with more power or intelligence in 
ways that were quite comparable, and be- 
cause the high-highs were in no way dif- 
ferent from the high-lows or low-highs; 
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it appears that the possession of greater 
intelligence may be the same as the pos- 
session of greater power insofar as the 
effect upon behavior in these groups is 
concerned. 

This similarity is due, we believe, to 
the fact that the contributions made by 
highly intelligent boys in a problem solving 
discussion represent resources which are 
valuable to the group. The possession of 
these resources, we assume, provided 
power for the owners to influence those 
who valued them. It is highly probable, 
although it cannot be demonstrated with 
the data available here, that boys with 
high intelligence were treated by group 
members as though they were persons with 
high power. As a consequence, boys with 
high intelligence became aware of the value 
of their ideas and of the influence they 
were having on the discussion. Although 
they had little power accorded to them 
previously by their classmates, boys with 
high intelligence apparently behaved in 
the groups like those who had come to 
the group with social power already at- 
tributed to them by their peers. 

It is noteworthy that boys low in in- 
telligence yet high in social power tended 
to be more inconsistent than low-lows in 
that they made both demands and weak 
proposals, and praised as well as criticized 
others, whereas boys high in intelligence 
but low in power more consistently pro- 
posed ideas and supported others’ be- 
havior than did the low-lows. The pos- 
sibility is suggested by these findings that 
greater power among boys generated more 
inconsistent and coercive group behavior 
while greater intelligence evoked a con- 
sistency and readiness to be considerate 
in relations with others. This conjecture 
is supported by the findings that peers 
characterized boys with greater power as 
threatening but did not so describe boys 
with greater intelligence. 

Girls who were low-lows were less suc- 
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cessful in influencing their groups and less 
often made positive remarks than girls 
high in power but low in intelligence. This 
indicates that low-low girls, like the low- 
low boys, were passive in their groups’ 
discussions. Low-low girls were not dif- 
ferent in any respect from girls low in 
power and high in intelligence, which sug- 
gests that high intelligence was not simi- 
lar to power among girls, as was earlier 
noted for boys. 

In most combinations of intelligence 
and power, boys were significantly more 
active in seeking to influence others, more 
often successful, more often unsuccessful, 
and more likely to evaluate the comments 
made by others than were girls. Low-low 
boys were very much like girls and most 
like low-low girls, differing from them in 
only two respects: they were significantly 
more likely to be demanding and to be 
positive in commenting upon the con- 
tributions made by others. 

In sum, the possession of either power 
or intelligence by boys appeared to stim- 
ulate vigorous and successful participa- 
tion in their groups’ work while the pos- 
session of low power and low intelligence 
together generated passivity in boys. The 
amount of intelligence or power a young 
person possesses affected a boy’s behavior 
more than it did a girl’s behavior in these 
problem solving discussions. 

To explain the consequences of power 
and intelligence for boys and girls we as- 
sume that these two attributes have a 
differential significance for the sexes in 
allowing them to conform to the expecta- 
tions society puts upon them. Barry, 
Bacon, and Child (1) have reported that 
boys are expected to be self-reliant and 
to strive for achievements, while girls are 
urged to be nurturant, obedient, and re- 
sponsible in almost all societies, including 
ours. 

We assume that, in their group be- 
havior, the boys and girls were attempting 
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to conform to these prescriptions for their 
sexes. Boys who were high in either in- 
telligence or social power more clearly ful- 
filled their sex roles than those lacking 
in both of these. Boys who were low in 
both intelligence and power least often 
showed the behavior required of their sex. 
The low degree of their intelligence and 
power apparently made them unable to 
perform in ways typically expected of 
them. Thus, either social power or in- 
telligence was necessary for boys if they 
were to act as boys are expected to act. 
There is some indication that social power 
was more important than intelligence in 
this respect. 

Girls who were high in power and in- 
telligence were little different from those 
who were low in either of these qualities 
because, we believe, high social power and 
intelligence were not needed in order to be 
the nurturant, obedient, or responsible 
persons required by society. Girls could 
fulfill these expectations regardless of the 
amount of power or intelligence they 
possessed. 


Perceptions by Teachers 


The children who were accorded higher 
social power by their classmates were 
viewed by teachers as most influential, 
since the teachers rated boys and girls 
with higher power as more successful in 
influencing others than those low in social 
power. 

On almost every category, teachers dis- 
tinguished between boys low in power and 
those high in power, regardless of the boys’ 
intellectual abilities. It is striking in this 
connection that teachers saw those with 
greater power as less forceful and more 
friendly than those with less power, 
whereas boys with greater power (and low 
intelligence) were demanding and less 
friendly than those with less power when 
observed in the small groups. Clearly 
teachers did not perceive fhe boys with 
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greater social power as threatening persons 
in the way that classmates saw them. 

In their considerations of girls, appar- 
ently those who were high in either intelli- 
gence or power were seen as more effective 
members of their classes than those low 
in these * respects. 

Why did the teachers characterize boys 
who had greater power (but low intelli- 
gence) as more considerate of others than 
was observed among them in the small 
groups? The most likely explanation is 
that their behavior actually was different 
in their classrooms from what it was in 
the discussions. In a class the teacher is 
in charge so that nondemanding, friendly 
behavior is required and the 
standard way for interacting with peers 
during school. It is also probable that 
teachers approved of achievement ori- 
ented and influential efforts when they 
were used by boys since such behavior is 
in accord with the demands that teachers, 
among other adults, place upon young 
males. Thus, teachers attributed positive 


becomes 


characteristics to the boys whom they 

viewed as most influential in their class- 

rooms. 
In the 


small groups no one was in 
charge. Hence, the boys with greater 
power, but less intelligence, were free to 
insist upon having their opinions accepted 
and were free to use coercion when neces- 
sary in order to be influential. 

Teachers did not perceive differences in 
classroom behavior between boys who 
were high in intelligence and those who 
were low in intelligence although differ- 
ences between these two groups were evi- 
dent in the problem solving discussions. 
It appears that variations in intelligence 
do not generate variations in social be- 
havior in a classroom, of the kind that 
teachers were asked to report. 

We assume that girls were less active 
in their classes and therefore less visible 
to the teachers than the boys. Thus, 
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teachers made fewer distinctions in char- 
acterizing the behavior among girls with 
different amounts of power than they did 
among boys. Girls with higher power were 
seen by teachers as most, friendly to class- 
mates, which suggests that through this 
friendliness they won influence in the 
classroom. Girls with higher intelligence 
were also seen by teachers as being more 
friendly and as making more attempts to 
influence others than girls with low in- 
telligence. It is probable that the girls 
with high intelligence did use acceptable 
forms of social influence in their school- 
room relations since they were rated as 
attractive by their classmates, more so 
than those with low intelligence. This ac- 
ceptance by their peers apparently gave 
them confidence to exercise their influence 
freely and teachers noted this in charac- 
terizing highly intelligent girls. 

In summary, teachers perceived the be- 
havior of girls as pretty much alike regard- 
less of their power or intelligence. They 
made no significant distinctions among 
boys with different degrees of intelligence, 
but they saw many distinctions in the 
behavior of boys who differed in social 
power. Assuming that the teachers’ per- 
ceptions are accurate, it is evident that a 
boy’s social power determines his be- 
havior in the classroom more than his 
intelligence does, whereas differences in 
the power or intelligence of a girl has 
little effect upon the behavior she employs 
in the schoolroom. 
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The present article considers the idea 
that college cultures may be seen as a 
complex of environmental press which, in 
turn, may be related to a corresponding 
complex of personal needs. In the psycho- 
logical literature, one is indebted to Henry 
Murray (5) for the dual concept of per- 
sonal needs and environmental press. In 
the broadest sense, the term “need” refers 
to denotable characteristics of individuals, 
including drives, motives, goals, ete. The 
term “press” can similarly be regarded as 
a general label for stimulus, treatment, or 
process variables. Murray's concept of 
needs has provided a starting point for 
the construction of various objective 
measures of personality (2, 3, 4). No 
parallel development in the objective 
measurement of environmental press, how- 


ever, has previously been attempted. Col- 
lege students differ. College environments 
also differ. The concept of press offers a 
way of viewing the environment which is 
comparable analytically and synthetically 
to the more familiar ways of dealing with 
the individual. 


DEVELOPMENT OF THE COLLEGE 
CHARACTERISTICS INDEX 


Using Murray's classification of needs 
as a model, Stern has constructed several 
experimental editions of a needs inven- 
tory, called the Activities Index. In its 
current form the Activities Index con- 
sists of 300 statements of commonplace, 
socially acceptable activities to which re- 
sponses of “like-dislike” are given. There 
are 30 scales of 10 items each, correspond- 


*Portions of this research have been 
financed by grants from the College En- 
trance Examination Board and the Carnegie 
Corporation. 


ing to 30 needs in Murray’s taxonomy. 
Some scales can be scored positively or 
negatively, as for example conjunctivity- 
disjunctivity, succorance-autonomy, im- 
pulsion-deliberation, etc., so that the total 
number of needs to which scores can be 
attached is 42 rather than 30. A pre- 
liminary manual for the Activities Index 
describes the test in detail (8). 

A corresponding test for describing col- 
lege environments, called the College 
Characteristics Index, was subsequently 
constructed. It consists of 300 statements 
about college environments to which re- 
sponses of “True-False” are given. The 
statements are organized into 30 ten-item 
scales, with a press scale for each need 
scale that was included in the Activities 
Index. The following kind of questions 
guided the writing of items: what might 
be characteristic of an environment which 
exerted a press toward order, or toward 
autonomy, or toward nurturance, or un- 
derstanding, or play, etc? Stated in an- 
other way, what might there be in a 
college environment which would be satis- 
fying to or tend to reinforce or reward 
an individual who had a high need for 
order, or autonomy, or nurturance, or 
understanding, or play, ete.? The items 
themselves are statements about college 
life. They refer to the curriculum, to col- 
lege teaching and classroom activities, to 
rules and regulations and policies, to 
student organizations and activities and 
interests, to features of the campus, etc. 

Sample items from corresponding Need 
and Press scales will illustrate the par- 
allelism. 

A need for Order would be inferred from 
liking such activities as: “Arranging my 
clothes neatly before going to bed. Hav- 
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ing a special place for everytiing and 
seeing that each thing is im its place. 
Keeping a calendar or notebook of the 
things I have done or plan to do.” What 
might such a person like to find in a col- 
lege environment or what features of a 
college environment might be rewarding 
or frustrating to such a need? The fol- 
lowing items from the press scale for 
Order might be relevant: “Faculty mem- 
bers and administration have definite and 
clearly posted office hours. In many ela-ses 
students have an assigned seat. Profes- 
sors usually take attendance in class.” 
On the need seale for Impulsion-De- 
liberation, a high score for Impulsion re- 
sults from liking such activities as: “Being 
in a situation that requires quick decisions 
and actions. Doing things on the spur 
of the moment. Doing whatever I’m in the 
mood to do.” Thus, a college environment 
which has a press toward impulsiveness 
might be a place where: “Most students 
don’t decide what courses to take until 
the time of registration. Students often 


start projects without trying to decide in 
advance how they will develop or s’here 


they may end. Spontaneous stude: + ral- 
lies and demonstrations occur frequently.” 
A high need for Energy is inferred from 
liking such activities as: “Taking up a 
very active sport. Having something to 
do every minute of the day. Giving all 
of my energy to whatever I happen to’ be 
doing.” The needs of such a person might 
be expected to find fulfillment and satis- 
faction in a college environment where: 
“There is an extensive program of intra- 
mural sports and informal athletic ac- 
tivities. Student gathering places are typi- 
cally active and noisy. Class discussions 
are typically vigorous and intense.” 
Just as needs are inferred from the 
characteristic modes of response of an in- 
dividual, so press are reflected in the 
characteristic pressures, stresses, rewards, 
conformity-demanding influences of the 
college culture. Operationally, press are 


the characteristic demands or features as 
perceived by those who live in the par- 
ticular environment. To each statement in 
the College Characteristics Index the per- 
son who takes the test answers true if he 
believes :t is generally characteristic of 
the college, is something which occurs or 
might occur, is the way people tend to 
feel or act; and he answers false if he 
believes it is not characterisic of the col- 
lege, is something which is not likely to 
occur, is not the way people typically 
feel or act. 


A Srupy or Five Institutions 


A first draft of the College Characteris- 
tics Index was administered in May, 1957, 
to groups of students at five institutions 
and to smaller groups of faculty members 
at four of the five institutions. In all, 423 
students and 71 faculty members re- 
sponded to the instrument. Neither stu- 
dent nor faculty groups were representa- 
tive samples. Most of the.students were 
upperclassmen and most of the faculty 
members were in the upper academic 
ranks. It was argued that if a dominant 
press really exists in a particular environ- 
ment almost any group of people living 
in the environment would probably 
identify it. The testing program was, in 
any case, intended only as a preliminary 
try-out of the model from which some 
information would be gained about the 
types of items, the possible reliability 
and validity of the scales, and the poten- 
tial utility of this approach to measuring 
college characteristics. 

The five institutions, although not iden- 
tified here, were selected because observers 
would probably agree that they are rather 
different from one another, with the selec- 
tion of colleges thus providing some evi- 
dence about the construct validity of the 
test. One was a large Midwestern state 
university. The second was a large Mid- 
western private university. The third in- 
stitution was a large Eastern private uni- 
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versity. The fourth was a moderate-sized 
Eastern private college for men. The fifth 
institution was a publicly supported col- 
lege in the metropolitan New York area. 


Test Results 

Saying that a particular press is or is 
not characteristic of an institution is an 
arbitrary matter. There exist no conven- 
tions or experience to guide the decision. 
The basis for the tentative decisions that 
were made, together with other statisti- 
cal information about the scales and the 
items are given in the following para- 
graphs. 

In examining means, or a profile on 
which mean scores are plotted, one natu- 
rally looks for what appear to be high 
and low points. And in examining variances 
of distributions, in the context of press 
identification, one naturally looks for 
some concentration of scores or a low 
variance, suggesting a consistency of im- 
pression, rather than a wide dispersal of 
scores, suggesting divergent impressions 
about the press. 

Table 1 shows the means and standard 
deviations on each of the press scales, 
computed from the students’ responses at 
the five institutions studied. Each of these 
30 scales has a maximum possible range of 
0 to 10. The median of these mean scores 
is approximately 5.5. The median of the 
standard deviations is approximately 1.7. 
Putting this information together, one 
might suggest that, for the five institu- 
tions represented, a fairly reasonable defi- 
nition of a noticeable press (or its ab- 
sence) would be one which required a mean 
score falling in the upper or lower one 
fourth of the total distribution. Mean 
scores of 6.6 or higher and mean scores 
of 44 or lower would thus be suggestive 
of a press. In Table 1, means which meet 
this criterion are in italics. 

A corresponding table showing faculty 
means and sigmas is not presented, mainly 
because the number of cases is quite 


small—Ns being 25, 20, 11, and 15. Also, 
from an educational view, it may be ar- 
gued that the effective press of an environ- 
ment is what the students say it is, not 
what the faculty say, or what the cata- 
logue says, etc. Nevertheless, in the four 
colleges where both student and faculty 
responses to the CCI were obtained it is 
of some pertinence to note their similar- 
ity. Table 2 shows a distribution of the 
differences between the means of stu- 
dents’ and faculty responses in each in- 
stitution and also the differences between 
faculty and student responses to indi- 
vidual items. Here it is evident that most 
of the differences are fairly small. In 
nearly half of all the comparisons between 
faculty and student mean scores the dif- 
ference between the two was less than 
half a point. Three fourths of all the dif- 
ferences were less than 1.00. Differences 
between faculty and student responses to 
individual items are grouped in three ar- 
bitrary categories. Thus, 44% of all the 
items answered by students and faculty 
were answered within a range of agree- 
ment of 10 percentage points or less; and 
on 12% of the items the percentage for 
the students’ answer differed by 30 points 
or more from the percentage for the 
faculty’s answer. Considering the number 
of cases in all groups, one can be confident 
that differences of 10 points or less are 
in all instances merely chance differences, 
and that differences of 30 points or 
greater may in most instances be signifi- 
cant differences at least at the 5% level 
of confidence. It appears then, that some 
of the differences in the middle category 
of the table are chance differences and 
some are significant differences. One might 
estimate that, over all, about three fourths 
of the items were answered in tolerably 
good agreement by both students and 
faculty, and that perhaps as many as one 
fourth of the item responses represent 
divergent views between students and 
faculty in characterizing the institution. 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS ON THE VARIOUS Press SCALES FROM 
Stupents’ Responses aT Five Institutions 
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(N = 100) 


College B 
(N = 44) 
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Reliability 

Test-retest data are not available in the 
present study. Conventional estimates of 
reliability from a single administration 
may not be appropriate for an instrument 
on which one hopes to find skewed dis- 
tributions and minimal dispersion of 
scores. Faculty-student agreement within 
the same institution is of some relevance 


: Natural Sciences. 


but one might argue that the perceptions 
of these two groups could differ on indi- 
vidual items or scales, yet each could be 
reliable within itself. 

Item analysis data are more directly 
relevant to the reliability of scales. An 
item analysis was made of each scale, 
separately for each of the five institutions, 
using the students’ responses. Ebel’s (1) 
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simplified method was used. Each of the 
30 seales was thus subjected to item 
analysis in five different samples, and since 
there are 10 items in each scale, the total 
number of item discrimination indexes 
obtained was 1500. Of these 1500 dis- 
crimination indexes, 1% were negative, 
18% fell between .0 and .19, 30% fell 
between .20 and 39, and 51% were 40 
or higher. In other words, 81% of the 
items had, on the average, moderate to 
high discrimination in their respective 
scales. 

Perhaps the most important approach 
is one which treats reliability and validity 
as inseparable and deals with the in- 
strument as a whole. For example, do 
different people characterize the institu- 
tion in the same way? This involves the 
reliability of profiles, with all their inter- 
relationships. As a first approximation of 
this, the rank order of mean scores from 
the students’ responses can be compared 
with the rank order of mean scores from 
faculty responses within the same institu- 
tion. Thus, do these groups see the insti- 
tution in relatively the same pattern? For 
the two colleges (Colleges B and C) which 
had the largest number of faculty re- 
spondents, these rank order correlations 
were .96 and 88. Correlations were not 
computed for the other two colleges where 
faculty mean scores would be based on 
Ns of 15 and 11. 

Validity 

To illustrate the sort of interpretation 
and description of a college environment 
which can be derived from the CCI, the 
press of two colleges are presented. The 
comments are based entirely on the ar- 
bitrary definitions of what levels of scores 
constitute a press (which was explained 
earlier); and the nature of these press is 
further illustrated by citing some of the 
specifle items which most clearly define 
them. No estimate is presently available 
of the validity of these descriptions against 
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TABLE 2 


Summary or DirrerReENcEs BETWEEN 
Facutty aNnp Strupent REsPoNsEs 
WITHIN Eacu or Four 
INSTITUTIONS 
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a systematic outside criterion. But the 
descriptions show quite clearly that these 
are very different environments, and that 
the test is therefore capable of revealing 
some sharp distinctions between colleges 
which qualified observers would expect to 
be different. The evidence is therefore 
relevant to the property of validity. De- 
tailed data, and descriptions of the other 
three colleges, may be found elsewhere 
(7). 


College A. 


The major press in College A are toward 
orderliness and friendly helpfulness, with 
overtones of spirited social activity. This 
is suggested by high scores on the scales 
for Order, Objectivity, Conjunctivity, Nur- 
turance, Play, Ego Achievement, Exhibi- 
tionism, and by low scores on the scales 
for Abasement, Impulsion, and Aggression. 

The stress on Order, Deliberation (op- 
posite of Impulsion), and Conjunctivity 
is indicated by such highly shared ob- 
servations as the following: students have 
assigned seats in some classes, professors 
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often take attendance, papers and reports 
must be neat, buildings are clearly marked, 
students plan their programs with an ad- 
viser and select their courses before regis- 
tration, courses proceed systematically, it 
is easy to take clear notes, student ac- 
tivities are organized and planned ahead. 

Within this orderliness, student life is 
spirited and a center of interest. For ex- 
ample, big college events draw lots of 
enthusiasm, parties are colorful and lively, 
there is lots to do besides going to classes 
and studying, students spend a lot of time 
in snack bars and in one another’s rooms, 
and when students run a project everyone 
knows about it. 

At the same time, amid this student- 
centered culture, there is a stress on 
idealism and service. Students are expected 
to develop an awareness of their role in 
social and political life, be effective citi- 
zens, understand the problems of less 
privileged people, be interested in chari- 
ties, etc. 

The total picture of the environment, 
then, is one of high social activity, esprit 
de corps, and enthusiasm combined with 
an emphasis on helping others and idealis- 
tic social action and all within a fairly 
well understood set of rules and expecta- 
tions which are deliberative and orderly. 
One would expect some of the explicit 
objectives of such an institution to stress 
personal and social development, idealism 
and social action, and civie responsibility. 


College B. 


Here the dominant press of the environ- 
ment fall in the theoretical-intellectual 
category—Reflectiveness, Humanism, Sci- 
entism, Understanding, and Objectivity. 
This dominant press occurs in an environ- 
ment also charaeterized by Change, non- 
defensive acceptance of criticism (Adap- 
tiveness), and by resistance to any abject 
aeceptance of criticism or presumed low 
status (Abasement). Moreover, on two of 
the seales which defined a high press at 
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College A (Play and Order), the press at 
B is exactly opposite. There is further, a 
minimum of importance to social status 
or manipulation for tangible ends (Prag- 
matism), preoccupation with self and per- 
sonal appearance (Narcissism), and boss- 
ing or directing others (Dominance). 
There is, however, a generally consistent 
and high press toward Deliberation or 
planning and thinking ahead. 

It is clear that the most pervasive press 
is directed toward the pursuit of under- 
standing for its own sake, abstract and 
unencumbered by requirements for prac- 
tical utility or social action. 

The theoretical-intellectual press of the 
environment at College B is more spe- 
cifically suggested by the following ob- 
servations with which, generally, more 
than nine tenths of both faculty and stu- 
dents agree: there are excellent library 
resources in natural science and social 
science, a lecture by an outstanding phi- 
losopher or scientist would draw a capac- 
ity audience, many students are planning 
graduate work or careers in science or 
social science, there are many opportuni- 
ties for students to see and hear and criti- 
cize modern art and music, reasoning and 
logic are valued highly in student reports 
and discussions, students who spend a lot 
of time in a science laboratory or in try- 
ing to analyze or classify art or music or 
in seeking to develop a personal system 
of values are not regarded as odd, scholar- 
ship and intellectual skills are regarded 
as more important than social poise and 
adjustment, there is time for private 
thought and reflection, one need not be 
afraid of expressing extreme views, the 
faculty and administration are tolerant 
and understanding in interpreting regula- 
tions. 

In contrast with College A, students at 
B do not have an assigned seat in class, 
professors do not take attendance, students 
are likely to study over the weekend, 
big college events draw no great enthusi- 
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asm, and the place is not described as 
one where “everyone has a lot of fun.” 

Moreover, student leaders have no 
special privileges, family status is not im- 
portant, students are not much concerned 
about personal appearance and grooming, 
and an intellectual is not an “egghead.” 

And finally, exams are not based on 
factual material from a textbook, classes 
are not characterized by recitation and 
drills, grade lists are not publicly posted, 
students are not publicly reprimanded for 
mistakes, student organizations are not 
closely supervised, students tend to stay 
up late at night, work all the harder if 
they have received a low grade, and if 
confronted with a regulation they do not 
like they will try to get it changed. 

One would expect the explicit objectives 
of such an institution to stress the aequi- 
sition of knowledge and theory, critical 
judgment and independence, and a sense 
of the significance of intellectual life. 


Other Differences Between Colleges 


That College A and College B are quite 
different environments can be suggested 
statistically as well as in the descriptive 
profiles just presented. The rank order of 
mean scores for College A correlates, for 
example, .06 with the rank order of mean 
scores for College B. One can also esti- 
mate, without computing, the significance 
of the differences between the mean scores 
of College A and College B on each of the 
30 seales. Given Ns of 100 and 44, and 
standard deviations as large as 2.00, a 
difference of 1.00 is significant well beyond 
the 1% level of confidence. On at least 22 
of the 30 scales, the differences between 
means for the two schools are significant. 
Considering all five schools, it is clear that 
on every scale except one (Deference) 
there are significant differences between 
two or more of the mean scores. 

Another indication of differences can be 
obtained from noting the differences be- 
tween the percentages of students at Col- 


leges A and B answering each item ac- 
cording to the key. In the middle range 
of percentages a difference of 20 points, 
with Ns of 100 and 44, is always signifi- 
cant beyond the 1% level. On this con- 
servative basis there were 172 of the 300 
items on which the percentages for Col- 
lege A differed significantly from College 
B. 

The point to these observations is 
merely to suggest that the first trial run 
of the CCI produced many results which 
clearly differentiated among the environ- 
ments or press of the five colleges. The 
actual items in the CCI are not repro- 
duced here because there seems little 
virtue in printing a first draft. The con- 
tent of many items has been, of course, 
indicated in the descriptive profiles of the 
two colleges and in the earlier paragraphs 
which noted the parallel structure of 
Need items and Press items. 


CONCLUSIONS AND IMPLICATIONS 


After completing the preliminary stud- 
ies reported thus far, a revised form of the 
College Characteristics Index was pro- 
duced in which 58% of the original items 
were retained, 13% were slightly modified, 
and 29% of the items were new. The re- 
vised form, at the time of writing this 
article, has been administered to approxi- 
mately 1200 students distributed among 
more than 30 institutions. Further re- 
search and more intensive analyses have 
been planned. Before commenting on spe- 
cifie plans, however, certain broad values 
and implications in this psychological ap- 
proach to the measurement of college 
environments are suggested. 

One potential value, for example, is in 
institutional self-analysis. Administrators 
and faculty members should be able to 
learn something useful about the dynamics 
of the college environment from studying 
students’ responses to the College Charac- 
teristics Index. Institutional press should 
have some clear relationship to institu- 
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tional purpose. The objectives of a col- 
lege are formal or explicit statements of 
intent: they indicate the directions in 
which a college means to influence the be- 
havior of students. They find expression 
in curricula, practices, services, policies, 
and other aspects of the college environ- 
ment. The press, as measured by the 
method described, constitute what Stern, 
Stein and Bloom (9) have referred to as 
an operational definition of objectives, or 
the implicit influence of the environment 
upon the students. Implicit press and 
explicit objectives should reinforce one 
another, for an institution should operate 
in reality the way it means to operate in 
theory. Consequently, a serious lack of 
congruence between implicit press and ex- 
plicit objectives would suggest to faculty 
members and administrators that certain 
aspects of the environment ought to be 
changed in order to make the total impact 
of the institution more consistent or more 
effective. Pace (6) has commented else- 
where on the disintegrative effect of dis- 
crepancy between stated objectives and 
actual practices. 

Some aspects of an environment can be 
changed more readily than others. The 
College Characteristics Index provides 
some direct indications of the psychologi- 
cal implications of various policies and 
practices. Roughly, one fourth to one 
third of the items in the Index state spe- 
cific practices which an administration or 
faculty could more or less easily change 
if they did not like the implications. For 
example, being able to drop a course in 
which one is having difficulty, or to sub- 
stitute another course for one which has 
been failed, is associated with Counter- 
action; insisting that students’ reports or 
papers be neat, or giving students an as- 
signed seat in class, or taking attendance 
regularly, is associated with Order. As 
the relationships among press variables 
and between these variables and institu- 
tional objectives as well as personal needs 
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are established, the significance of such 
specific practices will become clarified. 
Other items in the Index are more indirect 
in their implications about the effect of 
various policies or practices. But the clues 
can be investigated and can thus be the 
starting point for serious discussions about 
the impact of the environment on the 
student and the relation of this impact 
to the intended objectives. 

Another set of implications from this 
approach to describing college environ- 
ments relates to the problem of assess- 
ment and prediction. Assessment studies 
have often failed because the situations 
or environments in which assessed modes 
of behavior were supposed to occur have 
been inadequately described or differenti- 
ated. The interaction between person and 
environment was not successfully pre- 
dicted because the environment was not 
measured as analytically and systemati- 
cally as the person. The whole field of 
college prediction studies provides a good 
example. The criterion is typically a 
grade point average. No fundamental im- 
provement in predicting against this cri- 
terion has been made in the last 25 years. 
Prediction studies should be concerned 
with performance in the environment as a 
whole. The complexity of relationship be- 
tween person at‘ en,irenment is in- 
evitably obscured wy the simplified and 
often inappropriate symbolism of correla- 
tion between scholastic aptitude test and 
grade point average. The press of a col- 
lege environment represents what must 
be faced and dealt with by the student. 
It is possible that the total pattern of 
congruence between personal needs and 
environmental press will be more pre- 
dictive of achievement, growth, and 
change than any single aspect of either 
the person or the environment. 

It will be a long time before admissions 
officers or guidance counselors can benefit 
from the results of these more complex 
analyses. This requires the establishment 
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of known relationships between kinds of 
persons and kinds of environments. But 
conceivably, advisers within an institution 
may be better able to help students find 
an effective and rewarding role within the 
operative environment of the college, or 
to see more clearly the ways in which en- 
vironments need to be modified if differ- 
ent kinds of students are to grow within 
them most effectively. 

As further steps toward refining and ex- 
ploring the potential of this needs-press 
concept, as exemplified by the Activities 
Index and the College Characteristics In- 
dex, the following studies, among others, 
are in progress: 

1. Statistical studies of the instrument 
(CCI), including test-retest reliabilities, 
correlation matrix of all scale scores, fac- 
tor analysis, item covariances. 

2. Comparative studies of types of 
items, subjective or impressionistic versus 
relatively objective ‘or factual. 

3. Studies of the relations between stu- 
dents’ needs scores and the corresponding 
perception of press in the environment. 

4. Analysis of perception of press among 
various sub-cultures within a complex in- 
stitution. 

5. Significance of congruence between 
Need and Press in determining successful 
performance and/or satisfaction in the 
college environment. 


SUMMARY 


The present article has suggested that a 
college environment may be viewed as a 
system of pressures, practices, and poli- 
cies intended to influence the development 
of students toward the attainment of im- 
portant goals of higher education. The first 
draft of an instrument for measuring these 


influences systematically has been con- 
structed. Data analyzed thus far reveal 
significant differences in the press of dif- 
ferent college environments. The instru- 
ment itself, which has subsequently been 
revised, appears to be promisingly reli- 
able and valid. In the long run, research 
which this type of instrument makes pos- 
sible should increase understanding of 
the ways in which institutions make their 
impact upon students and provide a 
broader conceptualization for evaluating 
the effectiveness of higher education. 
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In 1931, 212 children in the San Fran- 
cisco Bay Area between the ages of 2 and 
5/2 years were given tests which later con- 
stituted Forms L and M of the 1937 Re- 
vision of the Stanford-Binet. Careful meth- 
ods were developed and adhered to in the 
selection of these children to avoid biased 
sampling and to insure a representative 
group, because they were the California 
sample of the nationwide standardization 
of this revision (9, pp. 12-15, 18; 7, pp. 
6-7, 36-37). : : 

Ten years later, in 1941, 138 of these 
children still in the area were adminis- 
tered Form L of the Stanford-Binet.’ 
Thirteen of the original group were not 
included because as two-year-olds they 
had missed so many items at the lowest 
test level that the obtained mental age 
was & maximum expression of their intelli- 
gence, the true estimate not being de- 
terminable. Another 61 could not be lo- 
cated. 

In 1956, 111 of the 1941 group were 
located and given Form L of the Stanford- 


? This investigation was supported by Re- 
search Grant M-1273 from the National 
Institutes of Health, Public Health Service. 

*At Stanford University when partici- 
pating in this research. 

*The authors wish to express their debt 
of gratitude to Maud A. Merrill and the 
late Lewis M. Terman for having made 
their standardization data available for fol- 
low-up. They also wish to express their ap- 
preciation for the continued interest of 
Quinn McNemar, who has served as con- 
sultant for both follow-ups. 


Binet and the Wechsler Adult Intelligence 
Seale. Nine Ss, although located, refused 
to participate; one S was in a mental hos- 
pital and not accessible for testing; an- 
other S was located in a community where 
no provisions could be made for testing; 
the remaining Ss could not be located. A 
detailed report of the results of this fol- 
low-up and their theoretical implications 
is in the process of preparation. This is 
being interrupted now to present a sum- 
mary of the actual IQ changes over the 
25-year period because of the pending ap- 
pearance of a new revision of the Stan- 
ford-Binet* and the appropriateness of 
having these retest data on the standard- 
ization group available at the same time. 


SUBJECTS AND PROCEDURE 


The Ss in the present study were ad- 
ministered Forms L and M of the Re- 
vised Stanford-Binet Scale 25 years pre- 
viously, in 1931, in connection with its 
standardization. They were also examined 
with Form L of this same scale in 1941 
(3, 4, 5). No interim contacts had been 
made. A comparison of the 1941 retest 
groiip of 138 Ss with the total standard- 
ization group at these ages showed some 
upward selection, as the initial mean com- 
posite IQ of the retest group was 109.2 
compared with 105.4 for the standardiza- 
tion group. Most of this difference was 
concentrated in the younger age groups, 


‘Personal communication from Maud A. 


Merrill. 
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where the children for whom the test was 
too difficult had been eliminated. The dis- 
tributions of paternal occupational level 
for the retest and standardization groups 
were similar at all ages. 

The loss of Ss between the 1941 and 1956 
testings resulted in further selection with 
respect to initial mean IQ, which rose to 
112.8 for the 1956 group with the elimina- 
tion of 27 Ss, 25 of whom were not lo- 
cated or refused to participate. Apparently 
availability for follow-ups was positively 
related to intelligence, although again the 
greatest discrepancy was for the group 
which had been two years of age at the 
initial testing. 

In the current testing of 1956, Form L 
of the Revised Stanford-Binet and the 
Wechsler Adult Intelligence Seale were 


administered. (In one case the WAIS was 
incomplete. In another the 1931 IQ is not 
valid; this case was included only in com- 
parisons with the 1941 results.) Of the 111 
Ss, 98 were tested by Cravens; two were 
tested by Bradway, who had administered 


all tests in the 1941 follow-up; the re- 
maining 11 Ss were tested by psychologists 
at colleges and universities located near 
the Ss’ current place of residence. Mo- 
bility was less than had been anticipated: 
of the 122 Ss located, only 24 lived more 
than 40 miles from where they had lived 
in 1981. 

The retest group was comprised of 52 
men and 59 women. This is the same pro- 
portion as in the 1941 testing. Inasmuch 
as no sex differences in test results were 
found in the standardization (9, p. 34), 
the results in the present study are given 
for the total group, combining both sexes. 


*The authors wish to acknowledge their 
indebtedness to the following psychologists 
who participated in this way: Steven K. 
Abe, Walter H. Brackin, Maurice Deigh, 
Robert Gray, George S. Ingebo, W. B. 
Lemmon, Jeanne Reitzell, Gordon and Vir- 
ginia Riley, Alwyn Sessions, Paul 8. Spitzer, 
and Helene R. Veltfort. 
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TABLE 1 
OBTAINED STanrorp-Binet IQs By YEAR 





1931* 
1941» 
1956> 


112.8 

112.3 

123.6 
* Composite Form ar Form M. 3 
> Form L. 





REsULTsS 


The degree of relationship between the 
initial composite IQ (Forms L and M) of 
these Ss when they were in the age range 
from 2.0 to 5.5 (mean age 4.0) and Form 
L IQ when they were in the age range 
from 26.5 to 32.2 (mean age 29.5) is ex- 
pressed by a Pearsonian r of .59. This com- 
pares favorably with the r of .65 found 
for the same group in the first follow-up 
after only 10 years. The correlation be- 
tween the 1941 and the 1956 testings is 
35. 

All but 19 of the 110 Ss showed a higher 
IQ on the 1956 testing than they did in 
1931. This was not limited to any one seg- 
ment of the IQ range. The mean increase 
was 10.8 points. The results in Table 1 
show that this increase occurred between 
1941 and 1956, that is between the early 
adolescent years and adulthood, since the 
mean IQs in 1931 and 1941 were approxi- 
mately the same. The mean IQ increase 
from 1941 to 1956 was 11.3 points. The 
increases from 1931 to 1956 and from 1941 
to 1956 are, of course, highly significant 
statistically with CRs of 8.1 and 13.7 re- 
spectively. The 1956 WAIS mean IQ of 
108.9 + 11.0 more nearly approximates 
the 1931 and 1941 Stanford-Binet mean 
IQs than does the 1956 Stanford-Binet." 


*The correlational values for the 1956 
WAIS for this group are similar to those for 
the 1956 Stanford-Binet. Pearsonian r with 
1931 Stanford-Binet is 64; with 1941 it is 
80. The correlation between the two 1956 
tests is 83. 
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These data are interpreted as invalidat- 
ing the assumption that intelligence stops 
increasing at 16 years, on which was based 
the standard computation of an adult 
Stanford-Binet IQ. This is consistent with 
the findings of other recent investigators. 
Bayley (1) in the Berkeley Growth Study 
found increases in Stanford-Binet scores 
of the same Ss up to the age of 17 years 
(the most recent administration), and on 
the Wechsler-Bellevue up to the age of 21 
years (most recent administration). More- 
over, a few 25-year scores available at the 
time of writing indicated that the ceiling 
of mental growth had not yet been reached 
by these Ss. The interpretation of these 
results, of course, must take account of 
the possibility of practice effects from fre- 
quent retesting. Owens (8) found an in- 
crease on Army Alpha scores by Ss who 
took the test initially at 19 years and again 
at 50 years. Bayley and Oden (2) in the 
most recent examination of Terman’s 
Gifted Children found increases in scores 
on the Concept Mastery Test for Ss cov- 
ering a total range from age 20 to age 50 
years, retested after a 12-year interval. 

It seems obvious that a correctional fac- 
tor of some sort should be applied to the 
adult Stanford-Binet IQs. Decision on the 
nature of such a correction for purposes of 
analyzing data of the present study is be- 
ing deferred until the pending publication 
of the new revision. It may be noted, how- 
ever, that whereas the IQs of 42% of the 
present group changed more than 10 points 
from 1931 to 1941, the IQs of 60% changed 
more than 10 points from 1931 to 1956. If 
the IQ is to be meaningful as an index, 
the average gain should be zero as it was 
between 1931 and 1941. When the IQs 
are equalized by taking account of the 
mean increase in IQ and subtracting 11 
points from the 1956 value, the frequency 
of changes greater than 10 points is re- 
duced to 42%. Similarly, 22% of the pres- 
ent group changed more than 15 points 
from 1931 to 1941, whereas 41% changed 
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more than 15 points from 1931 to 1956. 
Equalizing the IQs as described above, the 
latter is reduced to 22%. From 1941 to 
1956 the equalizing of the values reduces 
those changing more than 10 points from 
58% to 18% and those changing more than 
15 points from 28% to 7%. 
DiIscussION 

Beyond the problem of determining the 
optimal method to use in computing adult 
indices of intelligence is the question of 
what implications these data have for the 
nature of the mental growth curve and the 
theoretical question of terminal age of in- 
tellectual growth. A choice of method to 
use in arriving at the most useful index of 
adult intelligence requires the considera- 
tion of many factors. Wechsler’s use of 
IQs computed by setting a mean of 100 
and a standard deviation of 15 for each 
age is one solution. The central problem, 
however, is that intelligence, like so much 
of human behavior, is multifaceted. The 
point of view from which the question is 
approached depends upon what aspect is 
to be explored. Are we interested in an in- 
dex showing one’s place relative to others 
of the same age group, an index of one’s 
location on a scale of development and de- 
cline, an assessment of one’s various kinds 
of mental processes? Inasmuch as differ- 
ent abilities reach maturity at different 
ages and begin to decline at different ages, 
the particular abilities examined become 
deciding factors. For example, Corsini and 
Fassett (6) found from the testing of 1072 
adults on the Wechsler-Bellevue that gen- 
eral intelligence declines from early to 
late maturity only if visual and motor 
factors are tested, and increases if the 
items are dependent on continued learn- 
ing. We have in progress further research 
directed at determining what factors are 
associated with fluctuation in measured 
intelligence throughout the life span and 
the course of development and decline of 
different kinds of intelligence. 
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SuMMARY 

Of the 212 preschool children living in 
the San Francisco Bay Area who composed 
part of the standardization group of the 
1937 Revision of the Stanford-Binet, 111 
were retested as adults 25 years later. The 
sample was somewhat biased, with a tend- 
ency for Ss at the lower IQ levels to be 
more difficult to locate or to refuse to par- 
ticipate. Thus the initial mean IQ of those 
retested was 112.8 as compared with 105.4 
for the total group. 

The Pearsonian r between adult and 
preschool Stanford-Binet IQs over the 25- 
year period is 59. The mean IQ showed a 
rise of 10.8 points in 25 years. This rise 
occurred in the years after early adoles- 
cence, there being no increase in mean IQ 
between preschool and adolescent testings. 
This is interpreted as invalidating the 
assumption that intelligence stops increas- 
ing at 16 years. Further research is in 
progress. 
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The hypothesis that learning through 
independent discovery is superior to learn- 
ing by rote is well supported by existing 
research evidence (5, 6, 7) More recently, 
however, evidence has been published 
which suggests that atiempts to direct 
the learner in the discovery process may 
also be successful without loss in retention 
or transfer (2, 3). 

One reasonable explanation for the su- 
periority of both the independent discov- 
ery and the directed discovery processes 
is that learning under either condition is 
more meaningful than in the case where 
the learner simply memorizes answers. 
Meaning is used in the following report 
in the cognitive sense of understanding or 
organization. More precisely, through the 
discovery process, in which the learner is 
forced to rely on his own cognitive capaci- 
ties, he becomes cognizant of the relation- 
ships of the learning task to his previous 
experience, or to the pattern of relation- 
ships among the elements of the task. The 
superiority of such meaningful learning 
over rote learning is also well supported by 
research evidence (1, 4). 

If meaningful learning is the key con- 
cept, it should make no difference whether 
learning occurs with or without direction, 
so long as the learner becomes cognizant 
of the essential relationships. However, it 
is very likely that some procedures of 


*This study was supported by research 
grants from The Graduate School of the 
University of Oregon, Eugene, Oregon. Re- 
search assistance was ably provided by 
Harold K. Kozuma and Arthur O. Motz. 
An abbreviated version of this report was 
read at the joint meeting of the AERA and 
CERA in San Francisco, March 1958, under 
a different title. 


learning may be superior to others simply 
because they are more likely to cause the 
learner to become cognizant of the rela- 
tionships. 

The purpose of this research was to 
study the process of learning tasks involv- 
ing arithmetical and geometrical relation- 
ships in order to determine whether or not 
the superiority of the discovery and d*- 
rected-discovery procedure is adequately 
explained in terms of “meaningful learn- 
ing,” and, if not, to discover a more ade- 
quate explanation. 


DESCRIPTION OF THE TASKS 


Each S had the task of learning the 
following two rules of addition. 

1. The Odd-numbers rule. The sum of any 
series of consecutive odd numbers beginning 
with one is equal to the square of the num- 
ber of figures in the series. (For example, 
1, 3, 5, 7, is such a series; there are four 
numbers, so 4 times 4 is 16, the sum.) 

2. The Constant-difference rule. The sum 
of any series of numbers in which the dif- 
ference between the numbers is constant 
is equal to one-half the product of the 
number of figures and the sum of the first 
and last numbers. (For example, 2, 3, 4, 5, 
is such a series; 2 and 5 are 7; there are 
four figures, so 4 times 7 is 28; half of 28 
is 14 which is the sum.) 


These rules can be learned by simply 
memorizing the task procedure as above. 
On the other hand, the learner can become 
cognizant of certain relationships to geo- 
metrical and arithmetical concepts which 
the two rules involve, in which case his 
learning will be more meaningful. 

Consider first the Odd-numbers rule. In 
the first place, it is possible to relate the 
arithmetical concept of “squaring a num- 
ber” to the geometrical concept of 
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“square.” For example, "may be con- 
ceptualized as the arithmetical counter- 
part to the geometrical concept of a square 
having 4 units to the side. 

Now, it can readily be shown that any 
series of odd numbers beginning with one 
can be rearranged in the form of a geo- 
metric square, as is illustrated in Fig. 1. 
In Fig. 1, each row of X’s represents a 
number—the first row is “1,” the second 
row is “3,” and so on. The X’s which are 
left outside the square can be fitted into 
the spaces indicated by the dashes inside 
the box. 

The rule can also be related to the con- 
cept of the arithmetical average. The mean 
of any such series is the middle number; 
and the mean is always equal to the num- 
ber of figures in the series. So the mean 
times the number is equivalent to the num- 
ber squared. This relationship is suggested 
in Fig. 2 

Consider now the Constant-difference 
rule. The arithmetical and geometrical 
relationships involved in this rule are sug- 
gested in Fig. 3 and 4. Each figure essen- 
tially represents the original series in- 
verted and added across; and it becomes 
apparent that the sum of the first and the 
last number, the sum of the second and 
the second-from-the-last number, and so 
on, is the same. The sum of the original 
series is, of course, one half the sum of the 
column of sevens formed in Fig. 3, and 
half the area of the rectangle thus formed 
in Fig. 4. 


HyPoTHESsEs 


The experiment was designed to test 
the following hypotheses: 


1. It makes no difference in terms of re- 
tention or transfer effects whether the learner 
discovers the relationships which are essen- 
tial to the understanding of a cognitive 
task independently or with external di- 
rection. 

2. It is more probable that a learner be- 
comes cognizant of the relationships which 
are essential to understanding a cognitive 
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task when his attention is directed to the 
relationships than when his attention is 
directed to the task procedure alone, or when 
he is required to learn independently. 

3. It follows from the second hypothesis 
that it is more probable that what is learned 
is remembered longer and transferred more 
effectively when the learner's attention is 
directed to the essential relationships than 
when his attention is directed to the task 
procedure alone, or when he is required to 
learn independently. 

4. In the learning of tasks which involve 
both arithmetical and geometrical relation- 
ships, the learner most probably becomes 
cognizant of the former when conventional 
Hindu-Arabic symbols are used in directing 
his learning, and of the latter when more 
nearly iconic symbols are used. 


PROCEDURE 


One group of Ss, called the “no-help” 
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group, was required to discover the rules 
without help. A second group, called the 
“direct-reference” group, was given some 
direction in the form of perceptual aids, 
such as those illustrated in Fig. 1-4 above, 
with accompanying verbal instructions 
which directed their attention to the per- 
ceptual aids. A third group, called the 
“rule-given” group, was toid the rules di- 
rectly and was given practice in applying 
them, without any reference to the arith- 
metical or geometrical relationships. The 
three procedures were called, as a group, 
the “teaching treatments.” 

In addition, there were two treatments 
called the “number treatments,” which 
consisted simply of presenting the prob- 
lems in either the more nearly iconic form, 
called the “X-form,” or the conventional 
Hindu-Arabic form, called the “A-form.” 

The three teaching treatments and the 
two number treatments were combined to 
form six treatment combinations in a 2 X 
3 factorial design. The treatment combina- 
tions were identified as follows: No help, 
A-form (No-help A); No help, X-form 
(No-help X); Direct reference, A-form 
(Dir-ref A); Direct reference, X-form 
(Dir-ref X); Rule given, A-form (Rul- 
giv A); Rule given, X-form (Rul-giv X). 

A total of 60 college student volunteers 
from E’s two sections of Educational Psy- 
chology, taught in the Spring of 1957, 
formed the original sample. The Ss were 
divided into six equal groups through the 
use of a table of random numbers. 

The groups were compared in age, sex, 
grade level, and scholastic aptitude, and 
the differences were judged to be insignifi- 
cant. However, nine of the original group 
were eliminated either because the record- 
ings were defective or because they failed 
to complete the experiment. This left a 
total of eight in each group except one 
which had 10 remaining. Two more Ss 
were eliminated from the group with 10 
Ss by the use of a table of random num- 
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bers so as to make the groups equal in 
number. The remaining 48 Ss provided 
the data which were used in the analysis. 

The procedure was to have each of the 
Ss attempt to learn the two rules to the 
point where he could verbalize the rules 
and apply them in the solution of three 
different problems in succession. Tested 
individually, the Ss were asked to “think 
aloud” during the learning period, and 
voice recordings were made. 

Immediately following the learning pe- 
riod, a test consisting of 20 problems was 
given. Then, approximately four weeks 
later, the Ss were asked to solve two sim- 
ple problems and to fill out a questionnaire 
on their process of thinking. 

The learning problems and the test 
problems were reproduced on separate 
pieces of paper or cardboard so that they 
could be presented one at a time. The 
experimental procedure may best be de- 
scribed step-wise, as follows: 

Step 1. General instructions. The follow- 
ing instructions were given to each S. 


I am going to show you a series of addi- 
tion problems one at a time, and you are 
to try to discover how to find the sum of 
each problem without adding in the usual 
manner. 


Or, in the event the rule was to be 
given, the initial statement was as follows. 


I am going to tell you how to find the 
sum of two kinds of number series without 
adding, and then I will give you some prac- 
tice in using the rules. 


The remaining general instructions were 
the same for each experimental group. 


There are two types of problems, and 
there is a different rule for each of the two 
types. 


At this point two sample problems were 
presented, as illustrated in Fig. 5. 


The first type, called the Odd-numbers 
problem, consists of a series of odd numbers 
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beginning with one. Here is an example. 
(Point to the sample problem in the A- 
form.) Below the series you are given the 
number of figures in the series. In this case, 
there are five figures, so the “number” is 
five. 

Now look at the problem on the right. 
This problem is the same as the one on the 
left except that the numbers are represented 
in rows of X’s. Each row represents a num- 
ber. Do you see? (Explained, if necessary.) 

The second type problem is called the 
Constant-difference problem. This type con- 
sists of numbers which increase by a certain 
amount. The series may begin with any 
number and be of any length, but the 
numbers will always increase by a certain 
amount. In the example, the numbers in- 
crease by one. (Point to the sample prob- 
lem in the A-form.) Again, on the right is 
the same problem in the X-form. 


From this point the instructions were 
somewhat different for each treatment 
combination. For the two no-help groups 
the instructions were as follows. 


See if you can discover the rule for the 
Odd-numbers type problem first. Keep in 
mind that you are trying to discover a way 
of finding the sum of this type problem 
without adding in the usual manner. The 
problems I show you will all be examples of 
this type problem in the X-form (or con- 
ventional form). 


At this point the first “learning” prob- 
lem was presented. The learning problems 
appeared the same as the sample problem 
in Fig. 5, but were all in the appropriate 
number form. 


I want to record what you are thinking, 
so “think aloud” as much as possible. Tell 
me what you are thinking, in other words. 


After S learned the rule to the Odd- 
numbers type problem, the Constant-dif- 
ference problem was introduced by simply 
saying, “Now let’s see if you can discover 
the rule to the Constant-difference type 
problem.” 

The specific instructions to the two di- 
rect-reference groups were as follows: 
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See if you can discover the rule for the 
Odd-numbers type problem first. Keep in 
mind that you are trying to discover a way 
of finding the sum of this type problem 
without adding in the usual manner. The 
problems I show you will all be examples 
of this type problem in the X-form (or con- 
ventional form). 

In order to help you discover the rules, 
these examples provide you with additional 
hints. I'll show you these one at a time and 
explain the hints to you. 


The first problem was presented at this 
point. Examples of the learning problems 
used with the Dir-ref X and Dir-ref A 
groups are illustrated in Fig. 1 and 2, 
respectively. 

At this point, the instructions to the 
Dir-ref A group continued as follows. 


The problem will always be inside the box, 
and the hint will be to the right of the box 
where you see the column of fours. 

Notice that there are four 4's in the col- 
umn, the same number of figures as there 
are in the problem series. Also notice that 
when you add the two columns, you get the 
same sum. Does that give you any ideas? 

I want a record of what you are thinking, 
so “think aloud” as much as possible. Tell 
me what you are thinking, in other words. 


After S learned the rule to the Odd- 
numbers type problem, the Constant-dif- 
ference prv lem was introduced in the fol- 
lowing t.*.aner: 


Now let's see if you can discover the rule 
for the Constant-difference type problem. 
Again, the problem will always be inside the 
cay and the hint will be to the right of the 

OX. 
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Essentially what we have done here is to 
take the problem series, invert it, and add 
across in the manner indicated. When we 
do this, you notice that we always get the 
same number, in this case 7. In other words, 
when you add the first number to the last 
number, you get 7; when you add the sec- 
ond number to the next-to-the-last, you get 
7, and so on. 

Also notice that the sum of the first col- 
umn to the right is the same as that of the 
problem series, and the sum of the second 
column, the column of 7’s, is twice the sum 
of the problem series. Now, does that sug- 
gest to you any ideas? 


Starting again at the point at which the 
first problem is presented, the instructions 
to the Dir-ref X group continued as fol- 
lows. 


In these examples, the lines enclosing 
some of the X’s and the dashes in a box are 
the hints. Notice that the number of dashes 
in the box is the same as the number of X's 
remaining outside. In other words, the total 
number of X’s in the problem can be re- 
arranged in the form of the box. Does this 
give you any ideas? Tell me what you are 
thinking. “Think aloud,” in other words. 


After the Odd-numbers rule was dis- 
covered, the Constant-difference problem 
was introduced as follows. 


Now let’s see if you can discover the 
rule for the Constant-difference type prob- 
lem. Again the lines and dashes are your 
hints. Notice in this case that the number 
of dashes equals the number of X’s. Essen- 
tially what we have done here is to invert 
the problem series and attach it to the origi- 
nal in the manner indicated to form the 
box. In other: words, when you add the first 
row to the last you get 7; when you add 
the second row to the next-to-the-last, you 
get 7, and so on. In the end you have twice 
as many spaces within the box as there are 
X’s in the original problem. Does that give 
you any ideas? 


For the rule-given groups, the instruc- 
tions were as follows: 


The problems I will show you first are 
all examples of the Odd-numbers type prob- 
lem in the X-form (or conventional form). 
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The rule is to square the number of figures 
in the series to get the sum. The number of 
figures is 4. Four squared, or 4 times 4, is 
16, which is the sum. 

Now I will show you some more examples 
of this type problem. See if you can apply 
the rule correctly to each. Tell me what you 
are thinking. 


After learning the Odd-numbers rule to 
the established criterion of three success- 
ful applications in succession, the Con- 
stant-difference type problem was intro- 
duced as follows. 


Now let's see if you can learn the rule 
to the Constant-difference type problem. 
The rule is to add the first and the last 
numbers in the series, multiply by the num- 
ber of figures in the series, and then divide 
by two. With the first example, add the first 
and the last numbers (2 and 5 are 7), mul- 
tiply by the number (4 times 7 is 28), and 
divide by two (28 divided by 2 is 14). Four- 
teen is the correct sum. Now you practice 
the rule on these other examples 


Step 2. The learning period. The learn- 
ing period was considered to begin imme- 
diately after the instructions in Step 1 
were given, and to end immediately after 
the S successfully applied an acceptable 
rule to three problems in succession. An 
acceptable rule was defined as one which 
could be used in the solution of the prob- 
lems as presented in the learning period, 
even though it might be inadequate for use 
with the problems as presented in the test 
period. For example, a rule which was fre- 
quently considered is to multiply the mid- 
dle number in vhe series by the number 
of figures. This was adequate during the 
learning period because the problems in- 
cluded all the numbers; however, the test 
problems included only the first three num- 
bers and the last of the series, which made 
the rule awkward to apply. This criterion 
forced F to deviate slightiy from the ex- 
perimental procedure in some cases, as will 
be described below. 

Six different problems were used in the 
learning period, and they were basically 
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the same for each group. The general pro- 
cedure was to wait until S had developed 
a possible solution with one problem, then 
to give him the next problem, telling him 
to try it on another example. This pro- 
cedure was continued until S was success- 
ful. S was permitted to manipulate the 
cards himself if he wished, to back-track, 
or to look at several problems simultane- 
ously. Also, EZ repeated the instructions 
when asked, and answered questions re- 
garding the task. Otherwise, E limited his 
remarks to those which were intended to 
give support and encouragement, such as, 
“Try your idea on the next problem,” 
“You are doing very well,” and “See if it 
works.” In order to keep S thinking aloud, 
E would ask, “What are you thinking 
now ?” whenever S was silent for more than 
10 or 15 seconds. 

In the event that S discovered a rule 
which appeared to work satisfactorily but 
which was not the intended rule, £ first 
attempted to encourage him to continue 
searching by saying, “That’s fine, now see 
if you can discover another rule.” In the 
event that S perseverated, E would shift 
to the problems used during the first test 
period, and, as soon as S became aware of 
the inadequacy of his rule, would shift 
back again to the learning problems. 

There were some individuals, particu- 
larly in the no-help groups, who were 
simply unable to discover the intended 
rules within the time period of 60 to 90 
minutes scheduled for each S. In this 
event, the learning period was terminated. 

Step 3. The first test period. Immedi- 
ately following the learning period, S was 
given 20 problems to solve, five each of 
the following: Odd-numbers problems 
(A-form); Odd-numbers problems (X- 
form); Constant-difference problems (A- 
form); Constant-difierence problems (X- 
form). 

The 10 problems of each type were ac- 
tually five problems presented once in the 
A-form and once in the X-form. The prob- 
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lems were grouped by type and numeri- 
cal form and were clearly identified as 
such. The procedure was to present the 
Odd-numbers problems first, then the Con- 
stant-difference problems. Furthermore, 
the Ss who learned the rules using prob- 
lems in the A-form were presented the test 
problems in the A-form first. 

Step 4. The retest period. A retest was 
given to all Ss four to six weeks after the 
first test. The second test consisted of two 
problems which could most easily be 
solved by using the two rules which the 
Ss attempted to learn during the learning 
period, and a series of questions on their 
process of thinking. The problems and 
questions were completed under super- 
vision, all in one day. The instructions 
were to record all seratchwork in spaces 
provided for this purpose, and to write 
the answers to the questions as clearly and 
completely as possible. The problems were 
as follaws: (a) What is the sum of the 
first 35 odd numbers, and (6) what is the 
sum of all the first 35 numbers? The ques- 
tions which followed each problem were 
the following: 


Did you add the first 35 (odd) numbers 
to get your answer? (Yes or no) If your 
answer is no, explain how you obtained your 
answer. 

Did you try to recall the rule you learned 
(or attempted to learn) under our direc- 
tion several weeks ago? (Yes or no) 

Were you successful in recalling the rule? 
(Yes or no) 

Describe how you recalled or attempted 
to recall the rule. 


ANALYSIS OF THE RESULTS 


Not all Ss succeeded in learning the two 
rules as stated above to the established 
criterion. Some of those who failed did 
manage to discover other workable varia- 
tions of the rules which were judged to be 
acceptable. Others, however, failed to learn 
any workable rule for one of the two tasks 
before the practical limitations of time 
forced E to terminate the learning period. 
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TABLE 1 


NuMBER Wuo LEARNED Eacu RULE (OR AN 
AccEPTABLE VARIATION) TO CRITERION 
PERIOD 


Rul-giv All 


treat- 
ments 
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Odd-n s rul 
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1. 4n(a + 1) 
2. mn 

3. none 

All rules 








*In this column, n = number of figures in series; 
m = mean, or median of series; a = first number, and 
1 = last number in series. 


Table 1 shows the number of those who 
learned the rules and each of the accept- 
able variations, and those who failed to 
learn any workable rule in each case. 

Those who failed to learn an acceptable 
rule were retained in the experiment and 
retested along with the others four weeks 
later, in anticipation of the possibility 
that their performance on the retest prob- 
lems and their responses to the retest 
questionnaire would reveal useful informa- 
tion for further research. This indeed 
proved to be the case. 

The purpose of the 20-item test given 
immediately after the learning period was 
to detect differences among the Ss in their 
achievement, if any. All Ss who learned 
the stated rules to criterion seored per- 
fectly on the test, and those who failed to 
learn the acceptable rules were, of course, 
unable to solve the problems without 
adding. 

On the retest given four weeks later, the 
datum of primary interest was the method 
used in solving the problems, not whether 
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or not the correct answer was provided. 
Table 2 shows the methods used on the re- 
test and the number of Ss in each group 
who used each method. 

A comparison of Table 2 with Table 1 
reveals the changes in the learned methods 
and the methods used on the retest, but 
does not indicate whether or not the meth- 
ods that were used on the retest were the 
same as those which were developed dur- 
ing the learning period. It is apparent that 
there were at least three possibilities: (a) 
the same method was used, (b) a differ- 
ent method was used, or (c) the problem 
was solved by simple addition, which 
would indicate complete failure to recall 
or transfer the rule if learned. Table 3 
shows the number of Ss in each of the 
teaching treatment groups who, on the re- 
test, used the same method, some other 
method than that which they had learned, 
or simple addition. The Ss who failed to 
learn any acceptable rule in the learning 
period were placed in the “added” cate- 
gory if they added on the retest, and in 
the “other” category if they attempted 
some other procedure on the retest. 

The results presented above fail to sup- 
port the hypothesis that the Ss whose at- 
tention is directed to the relationships 
which are essential to understanding re- 
member their learning longer and transfer 
it more effectively than do the Ss of the 
other two treatment groups (Hypothesis 
3, above). The prediction was that the di- 
rect-reference group would be superior. In- 
stead, although the obtained differences 
are not highly reliable, they are consistent 
with previously published data to the ex- 
tent that they suggest that the independ- 
ent discovery procedure is superior to 
learning by rote. 

The reader’s attention is directed to the 
fact that although 13 Ss in the no-help 
groups failed to learn an acceptable rule 
for the Constant-difference problem dur- 
ing the learning period, there were only 
four who added on the retest, and 10 who 
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TABLE 2 
MeTHops Usep ON THE RETEST 


| | ° 
Noelp | Dire | nsteiy | Statues | ‘einen 
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Method* — —|- — Pw Pa — 
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number in series. 

> Frequencies above and below line combined for chi-square analysis 

* Not significant at the 0.05 level 

** Significant at the 0.05 level 


TABLE 3 


CORRESPONDENCE OF MetHops Usep on THE Retest with THose 
LEARNED DvurRING THE LEARNING PERIOD 


All number | All teaching 
treatments | treatments 
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TABLE 4 
NuMBER WHO WERE JUDGED TO BE CoG- 
NIZANT OF Eacu Type or RELATIONSHIP 
DuRING THE LEARNING 
PERIOD 


Type | 
Relationship 


— x} A 
od 


)dd- atin rule 


3 
it 


| ¢ 
Arithmetical 1/1 1/0/90 
Geometrical 0 
Both | 0 
Neither 7 


Arithmetical | 

Geometrical }0'0); 2 
Both | 0 
Neither | 6 | | 36 





used acceptable methods. In the case of 
the other two teaching treatment groups, 
the number who added increased, and the 
number who used acceptable rules de- 
creased rather markedly. This finding sug- 
gests that as a result of their experience 
during the learning period, the Ss in the 
no-help groups were motivated to continue 
learning afterwards and those treated 
otherwise were not. 

The superiority of the discovery pro- 
cedure may be better explained in terms 
of motivation, then, than in terms of un- 
derstanding. This suggestion is given added 
support by the following data. 

In an attempt to determine the num- 
ber of Ss in each experimental group who 
actually became cognizant of the arith- 
metical, the geometrical, or both types of 
relationship, the typed transcriptions of 
the voice recordings made during the 
learning period were carefully examined. 
Any S was judged to be cognizant of a 
relationship if he verbalized it at any point 
in the process of his learning. As a result 
of E’s continued efforts to stimulate each 
S to think aloud during the learning pe- 
riod, very complete records were obtained. 
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In addition, each S who failed to volunteer 
an explanation of the rule was asked to 
do so at the end of the period. In most 
cases of the latter type, the S’s response 
was that he could not provide an explana- 
tion. Table 4 shows the results of this 
analysis. 

Most striking is the fact that only one- 
quarter of the entire group of 48 Ss was 
judged to be cognizant of one or both 
types of relationship to each of the two 
rules. The small number of Ss precludes 
any statistical check of the reliability of 
the obtained difference in the experi- 
mental groups. However, accepting the 
obtained data as reliable, the results 
pear to support the prediction that the 
direct-reference treatment would produce 
the greatest incidence of understanding. 
The next highest frequency of understand- 
ing was in the no-help group, and even in 
the rule-given group there was at least 
one S who understood the odd-numbers 
rule. Furthermore, with only two excep- 
tions, both in the no-help group, the type 
of relationship discovered corresponded to 
the type that was predicted from the num- 
ber treatment. When the problems were 
presented in the A-form, the arithmetical 
relationship was discovered, and when the 
X-form was used, the geometrical rela- 
tionship was discovered. The data, there- 
fore, tend to support Hypotheses 2 and 4. 

The data presented in Table 4 do not 
preclude the possibility, however, that the 
Ss who were not cognizant of the relation- 
ships at the end of the learning period 
subsequently did become cognizant of 
them. The most adequate test of the effi- 
cacy of understanding in learring is to de- 
termine how those who actually verbalized 
the relationships performed on the retest 
problems. Table 5 shows the number of 
those who, on the retest, used the same 
rule they learned, those who simply added, 
and those who used some other method 
than that learned. 

Again, the dearth of data does not per- 
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LEARNING BY_ INDEPENDENT DISCOVERY 


TABLE 5 
Mertuops Usep on RETEST oF THOSE WHO 
WERE JUDGED TO BE COGNIZANT OF THE 
EssENTIAL RELATIONSHIPS 


| No-help 


Dir-ref | Rul-giv 
| All 


} 
Groups 


Methods _ 
A|X | A|X|A/{X 
Problem 1 

Same Rule 1/3 | 0 2/0 | 
Added 10';0/;0/;1)/ 0) 
Others 0/;0;1/2/0} 
All Methods | 1/3/1|5| 0 
Problem 2 
24/113;0;0] 
1/0 2/|0 


Same Rule 
Added } 1/0) 

Others 0'0/;3 0)'0 
All Methods 31:1/)6 2/0 


mit a conclusive test of the first hypothesis. 
However, since only about one-half of 
those who learned acceptable rules with 
understanding during the learning period 
succeeded in retaining and transferring 
their learning four weeks later, the data 
do not offer very conclusive support of 
the meaning theory. 


Discussion 


Perhaps the data which have been pre- 
sented are sufficient to suggest the inade- 
quacy of the “meaning” explanation, but 
they fail to communicate the impression 
of markedly increased motivation or inter- 
est in the task which characterized many 
of the Ss in the no-help group as com- 
pared with many of those in the other 
groups. The difference in motivation was 
reflected in their written comments on the 
retest and verbal reports to E. For ex- 
ample, one S in the no-help group re- 
ported that he was so intrigued with his 
suecess in discovering the rules that he 
told his friends of his experience and 
tested their ability. Others in the no-help 
group who failed to discover the rule told 
of their efforts to learn the rule, even go- 
ing so far as to look up the algebraic 
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formula in the library. On the other hand, 
one S in the rule-given group complained 
that E had not instructed him to remem- 
ber the rules, so he had promptly forgot- 
ten them. Others in the rule-given and 
direct-reference groups commented on 
their confusion during the learning period 
in explaining their inability to recall the 
rules. The evaluation of the Ss’ comments 
was admittedly highly subjective, but it 
may better suggest the nature of the mo- 
tivation which is offered as the more ade- 
quate explanation for the superiority of the 
discovery procedure. 

Of the various descriptive concepts of 
human motivation known to this present 
writer, Allport’s concept of functional 
autonomy describes the motivation 
that was developed by those in the no-help 
group. Whereas, at the beginning of the 
learning period, all the participants were 
assumed to be alike in that they were mo- 
tivated primarily by such extrinsic factors 
as the approval of F, at the end of the 
learning period, the motivation of many 
of those in the no-help group appeared to 
be independent of the experimental or in- 
structional situation. Presumably, the mo- 
tivating power is of the type that lies in 
acquired interest or ego involvement in a 
task, and dvelops to the extent that the 
individual relies on his own cognitive ca- 
pacities in learning. 

The findings place the teacher in some- 
what of a dilemma. He must decide which 
is the more important outcome of learn- 
ing, (a2) maximum understanding, or (}) 
maximum motivation to continue learning. 
However, further research on the problem 
may reveal ways of directing the discovery 
process without inhibiting the develop- 
ment of autonomous motivation. Teachers 
should continue to strive to guide the 
learning of their students but should re- 
frain from giving answers directly. 

Finally, the findings suggest that more 
consideration be given to the influence of 
the stimulus materials in directing the 
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thought processes of the learner. Particu- 
larly recommended is a careful study of 
the educational “toys” that are used in 


some primary classrooms to develop num- 
ber concepts, and the practice exercises in 
workbooks and on worksheets which are 
intended to direct the learner’s attention 
to relationships. 


SuMMARY AND CONCLUSIONS 


An experiment was performed with col- 
lege students in order to test the premise 
that learning by independent discovery 
is superior to learning with direction be- 
cause the learning is more meaningful in 
the former case. “Meaningful learning” 
was used in the cognitive sense of under- 
standing or organization. The Ss were re- 
quired to learn two arithmetical tasks to 
a common criterion. A two-factorial de- 
sign was used which involved three “teach- 
ing” treatments (no-help, direct-reference, 
and rule-given), and two “number” treat- 
ments (A-form and X-form). Approxi- 
mately four weeks later a retest was ad- 
ministered which was designed to test the 
ability to recall and apply the learned rules 
to a somewhat different type of problem 
from that used in the learning period. The 
Ss’ self-reports of their learning process 
and methods used on the retest were ana- 
lyzed. 

The conclusion is that the superiority of 
the discovery procedure of learning over 
procedures of learning with external di- 
rection is not adequately explained in 
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terms of “meaningful learning.” The re- 
sults of this experiment suggest that when 
the learner is forced to rely on his own 
cognitive capacities, it is more likely that 
he will become motivated to continue the 
learning process or to continue practicing 
the task after the lerrning period. Conse- 
quently, the learning becomes more per- 
manent and is more effectively transferred 
than when the learner is not so motivated. 
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